Mild Optimization

Tag

AISC project: SatisfIA – AI that satisfies without overdoing it

Jobst Heitzig11 Nov 2023 18:22 UTC

11 points

0 comments1 min readLW link

(docs.google.com)

Aspiration-based Q-Learning

Clément Dumas and Jobst Heitzig

27 Oct 2023 14:42 UTC

37 points

5 comments11 min readLW link

AISC team report: Soft-optimization, Bayes and Goodhart

Simon Fischer, benjaminko, jazcarretao, DFNaiff and Jeremy Gillen

27 Jun 2023 6:05 UTC

37 points

2 comments15 min readLW link

Requirements for a STEM-capable AGI Value Learner (my Case for Less Doom)

RogerDearnaley25 May 2023 9:26 UTC

32 points

3 comments15 min readLW link

[Question] Why don’t quantilizers also cut off the upper end of the distribution?

Alex_Altair15 May 2023 1:40 UTC

25 points

2 comments1 min readLW link

Thinking about maximization and corrigibility

James Payor21 Apr 2023 21:22 UTC

63 points

4 comments5 min readLW link

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning

Roger Dearnaley21 Feb 2023 9:05 UTC

10 points

1 comment23 min readLW link

Validator models: A simple approach to detecting goodharting

beren20 Feb 2023 21:32 UTC

14 points

1 comment4 min readLW link

Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning

Roman Leventov12 Jan 2023 16:43 UTC

17 points

2 comments2 min readLW link

(arxiv.org)

Soft optimization makes the value target bigger

Jeremy Gillen2 Jan 2023 16:06 UTC

117 points

20 comments12 min readLW link

Exploring Mild Behaviour in Embedded Agents

Megan Kinniment27 Jun 2022 18:56 UTC

21 points

4 comments18 min readLW link

Steam

abramdemski20 Jun 2022 17:38 UTC

134 points

13 comments5 min readLW link 1 review

When to use quantilization

RyanCarey5 Feb 2019 17:17 UTC

65 points

5 comments4 min readLW link

Optimization Regularization through Time Penalty

Linda Linsefors1 Jan 2019 13:05 UTC

11 points

4 comments3 min readLW link

Stable Pointers to Value III: Recursive Quantilization

abramdemski21 Jul 2018 8:06 UTC

20 points

4 comments4 min readLW link

Quantilal control for finite MDPs

Vanessa Kosoy12 Apr 2018 9:21 UTC

14 points

0 comments13 min readLW link

Thoughts on Quantilizers

Stuart_Armstrong2 Jun 2017 16:24 UTC

2 points

0 comments2 min readLW link

Quantilizers maximize expected utility subject to a conservative cost constraint

jessicata28 Sep 2015 2:17 UTC

33 points

3 comments5 min readLW link

Satisficers want to become maximisers

Stuart_Armstrong21 Oct 2011 16:27 UTC

37 points

70 comments1 min readLW link

The Optimizer’s Curse and How to Beat It

lukeprog16 Sep 2011 2:46 UTC

97 points

84 comments3 min readLW link