Optimization

TagLast edit: Dec 30, 2024, 9:33 AM by Dakara

Optimization is any kind of process that systematically comes up with solutions that are better than the solution used before. More technically, this kind of process moves the world into a specific and unexpected set of states by searching through a large search space, hitting small and low probability targets. When this process is gradually guided by some agent into some specific state, through searching specific targets, we can say it prefers that state.

The best way to exemplify an optimization process is through a simple example: Eliezer Yudkowsky suggests natural selection is such a process. Through an implicit preference – better replicators – natural selection searches all the genetic landscape space and hit small targets: efficient mutations.

Consider the human being. We are a highly complex object with a low probability to have been created by chance—natural selection, however, over millions of years, built up the infrastructure needed to build such a functioning body. This body, as well as other organisms, had the chance (was selected) to develop because it is in itself a rather efficient replicator suitable for the environment where it came up.

Or consider the famous chessplaying computer, Deep Blue. Outside of the narrow domain of selecting moves for chess games, it can’t do anything impressive: but as a chessplayer, it was massively more effective than virtually all humans. It has a high optimization power in the chess domain but almost none in any other field. Humans or evolution, on the other hand, are more domain-general optimization processes than Deep Blue, but that doesn’t mean they’re more effective at chess specifically. (Although note in what contexts this optimization process abstraction is useful and where it fails to be useful: it’s not obvious what it would mean for “evolution” to play chess, and yet it is useful to talk about the optimization power of natural selection, or of Deep Blue.)

Measuring Optimization Power

One way to think mathematically about optimization, like evidence, is in information-theoretic bits. The optimization power is the amount of surprise we would have in the result if there were no optimization process present. Therefore we take the base-two logarithm of the reciprocal of the probability of the result. A one-in-a-million solution (a solution so good relative to your preference ordering that it would take a million random tries to find something that good or better) can be said to have log_2(1,000,000) = 19.9 bits of optimization. Compared to a random configuration of matter, any artifact you see is going to be much more optimized than this. The math describes only laws and general principles for reasoning about optimization; as with probability theory, you oftentimes can’t apply the math directly.

The ground of optimization

Alex FlintJun 20, 2020, 12:38 AM

248 points

80 comments27 min readLW link 1 review

Measuring Optimization Power

Eliezer YudkowskyOct 27, 2008, 9:44 PM

90 points

37 comments6 min readLW link

Optimization Amplifies

Scott GarrabrantJun 27, 2018, 1:51 AM

114 points

12 comments4 min readLW link

Optimization

Eliezer YudkowskySep 13, 2008, 4:00 PM

55 points

45 comments5 min readLW link

Selection vs Control

abramdemskiJun 2, 2019, 7:01 AM

175 points

26 comments11 min readLW link 2 reviews

DL towards the unaligned Recursive Self-Optimization attractor

jacob_cannellDec 18, 2021, 2:15 AM

32 points

22 comments4 min readLW link

Risks from Learned Optimization: Introduction

evhub, Chris van Merwijk, Vlad Mikulik, Joar Skalse and Scott Garrabrant

May 31, 2019, 11:44 PM

187 points

42 comments12 min readLW link 3 reviews

Aiming at the Target

Eliezer YudkowskyOct 26, 2008, 4:47 PM

40 points

40 comments5 min readLW link

Thoughts and problems with Eliezer’s measure of optimization power

Stuart_ArmstrongJun 8, 2012, 9:44 AM

36 points

24 comments5 min readLW link

Beren’s “Deconfusing Direct vs Amortised Optimisation”

DragonGodApr 7, 2023, 8:57 AM

52 points

10 comments3 min readLW link

The Optimizer’s Curse and How to Beat It

lukeprogSep 16, 2011, 2:46 AM

100 points

84 comments3 min readLW link

Optimality is the tiger, and agents are its teeth

VeedracApr 2, 2022, 12:46 AM

334 points

44 comments16 min readLW link 1 review

Bottle Caps Aren’t Optimisers

DanielFilanAug 31, 2018, 6:30 PM

100 points

23 comments3 min readLW link 1 review

(danielfilan.com)

Optimization Concepts in the Game of Life

Vika and Ramana Kumar

Oct 16, 2021, 8:51 PM

75 points

16 comments10 min readLW link

Steering systems

Max HApr 4, 2023, 12:56 AM

50 points

1 comment15 min readLW link

Towards Measures of Optimisation

mattmacdermott and Alexander Gietelink Oldenziel

May 12, 2023, 3:29 PM

53 points

37 comments4 min readLW link

Requirements for a STEM-capable AGI Value Learner (my Case for Less Doom)

RogerDearnaleyMay 25, 2023, 9:26 AM

33 points

3 comments15 min readLW link

Goodhart’s Curse and Limitations on AI Alignment

Gordon Seidoh WorleyAug 19, 2019, 7:57 AM

25 points

18 comments10 min readLW link

Utility Maximization = Description Length Minimization

johnswentworthFeb 18, 2021, 6:04 PM

216 points

50 comments6 min readLW link

The Credit Assignment Problem

abramdemskiNov 8, 2019, 2:50 AM

105 points

40 comments17 min readLW link 1 review

Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

adamShimiJul 29, 2022, 6:59 PM

75 points

3 comments16 min readLW link

Deconfusing Direct vs Amortised Optimization

berenDec 2, 2022, 11:30 AM

136 points

19 comments10 min readLW link

Defining “optimizer”

ChantielApr 17, 2021, 3:38 PM

9 points

6 comments1 min readLW link

A new definition of “optimizer”

ChantielAug 9, 2021, 1:42 PM

5 points

0 comments7 min readLW link

Quantifying General Intelligence

JasonBrownJun 17, 2022, 9:57 PM

9 points

6 comments13 min readLW link

Ngo and Yudkowsky on AI capability gains

Eliezer Yudkowsky and Richard_Ngo

Nov 18, 2021, 10:19 PM

131 points

61 comments39 min readLW link 1 review

What is optimization power, formally?

sbenthallOct 18, 2014, 6:37 PM

18 points

16 comments2 min readLW link

Difficulty classes for alignment properties

JozdienFeb 20, 2024, 9:08 AM

34 points

5 comments2 min readLW link

Applications for Deconfusing Goal-Directedness

adamShimiAug 8, 2021, 1:05 PM

38 points

3 comments5 min readLW link 1 review

Two senses of “optimizer”

Joar SkalseAug 21, 2019, 4:02 PM

35 points

41 comments3 min readLW link

Fundamental Uncertainty: Chapter 4 - Why don’t we do what we think we should?

Gordon Seidoh WorleyAug 29, 2022, 7:25 PM

15 points

6 comments13 min readLW link

In Defence of Optimizing Routine Tasks

leogaoNov 9, 2021, 5:09 AM

47 points

6 comments3 min readLW link 1 review

Search versus design

Alex FlintAug 16, 2020, 4:53 PM

109 points

40 comments36 min readLW link 1 review

Vingean Agency

abramdemskiAug 24, 2022, 8:08 PM

63 points

14 comments3 min readLW link

Consequentialism is in the Stars not Ourselves

DragonGodApr 24, 2023, 12:02 AM

7 points

19 comments5 min readLW link

Bits of Optimization Can Only Be Lost Over A Distance

johnswentworthMay 23, 2022, 6:55 PM

31 points

18 comments2 min readLW link

Gaia Network: a practical, incremental pathway to Open Agency Architecture

Roman Leventov and Rafael Kaufmann Nedal

Dec 20, 2023, 5:11 PM

22 points

8 comments16 min readLW link

Defining Optimization in a Deeper Way Part 4

J BostockJul 28, 2022, 5:02 PM

7 points

0 comments5 min readLW link

Mesa-Optimizers vs “Steered Optimizers”

Steven ByrnesJul 10, 2020, 4:49 PM

45 points

7 comments8 min readLW link

Life’s Story Continues

Eliezer YudkowskyNov 21, 2008, 11:05 PM

24 points

14 comments5 min readLW link

Mesa-Optimizers and Over-optimization Failure (Optimizing and Goodhart Effects, Clarifying Thoughts—Part 4)

DavidmanheimAug 12, 2019, 8:07 AM

15 points

3 comments4 min readLW link

Searching for Searching for Search

Rubi J. HudsonFeb 14, 2024, 11:51 PM

21 points

4 comments7 min readLW link

Draft: Detecting optimization

Alex_AltairMar 29, 2023, 8:17 PM

23 points

2 comments6 min readLW link

Fake Optimization Criteria

Eliezer YudkowskyNov 10, 2007, 12:10 AM

74 points

21 comments3 min readLW link

Meaning & Agency

abramdemskiDec 19, 2023, 10:27 PM

91 points

17 comments14 min readLW link

Defining Optimization in a Deeper Way Part 1

J BostockJul 1, 2022, 2:03 PM

7 points

0 comments2 min readLW link

Is the term mesa optimizer too narrow?

Matthew BarnettDec 14, 2019, 11:20 PM

39 points

21 comments1 min readLW link

Mathematical Measures of Optimization Power

Alex_AltairNov 24, 2012, 10:55 AM

8 points

16 comments5 min readLW link

Notes on Simplicity

David GrossDec 2, 2020, 11:14 PM

9 points

0 comments7 min readLW link

Draft: The optimization toolbox

Alex_AltairMar 28, 2023, 8:40 PM

20 points

1 comment7 min readLW link

Optimization Provenance

Adele LopezAug 23, 2019, 8:08 PM

38 points

5 comments5 min readLW link

Game Theory without Argmax [Part 2]

Cleo NardoNov 11, 2023, 4:02 PM

31 points

14 comments13 min readLW link

Fat Tails Discourage Compromise

niplavJun 17, 2024, 9:39 AM

53 points

5 comments1 min readLW link

[Question] How Many Bits Of Optimization Can One Bit Of Observation Unlock?

johnswentworthApr 26, 2023, 12:26 AM

62 points

32 comments3 min readLW link

Clarifying mesa-optimization

Marius Hobbhahn and Pierre Peigné

Mar 21, 2023, 3:53 PM

38 points

6 comments10 min readLW link

What I Learned Running Refine

adamShimiNov 24, 2022, 2:49 PM

108 points

5 comments4 min readLW link

Don’t align agents to evaluations of plans

TurnTroutNov 26, 2022, 9:16 PM

48 points

49 comments18 min readLW link

[Question] Do the Safety Properties of Powerful AI Systems Need to be Adversarially Robust? Why?

DragonGodFeb 9, 2023, 1:36 PM

22 points

42 comments2 min readLW link

Game Theory without Argmax [Part 1]

Cleo NardoNov 11, 2023, 3:59 PM

70 points

18 comments19 min readLW link

Draft: Introduction to optimization

Alex_AltairMar 26, 2023, 5:25 PM

43 points

8 comments16 min readLW link

The First World Takeover

Eliezer YudkowskyNov 19, 2008, 3:00 PM

42 points

24 comments6 min readLW link

Towards a formalization of the agent structure problem

Alex_AltairApr 29, 2024, 8:28 PM

55 points

6 comments14 min readLW link

“Normal” is the equilibrium state of past optimization processes

Alex_AltairOct 30, 2022, 7:03 PM

82 points

5 comments5 min readLW link

Measurement, Optimization, and Take-off Speed

jsteinhardtSep 10, 2021, 7:30 PM

48 points

4 comments13 min readLW link

Distributed Decisions

johnswentworthMay 29, 2022, 2:43 AM

66 points

6 comments6 min readLW link

Family-line selection optimizer

lemonhopeApr 22, 2025, 7:16 AM

2 points

0 comments1 min readLW link

Defining Optimization in a Deeper Way Part 3

J BostockJul 20, 2022, 10:06 PM

8 points

0 comments2 min readLW link

Defining Optimization in a Deeper Way Part 2

J BostockJul 11, 2022, 8:29 PM

7 points

0 comments4 min readLW link

Opportunity Cost Blackmail

adamShimiJan 2, 2023, 1:48 PM

70 points

11 comments2 min readLW link

(epistemologicalvigilance.substack.com)

Draft: Inferring minimizers

Alex_AltairApr 1, 2023, 8:20 PM

9 points

0 comments1 min readLW link

Adversarial attacks and optimal control

JanMay 22, 2022, 6:22 PM

17 points

7 comments8 min readLW link

(universalprior.substack.com)

Degrees of Freedom

sarahconstantinApr 2, 2019, 9:10 PM

103 points

31 comments11 min readLW link

(srconstantin.wordpress.com)

Degeneracies are sticky for SGD

Guillaume Corlouer and Nicolas Macé

Jun 16, 2024, 9:19 PM

56 points

1 comment16 min readLW link

Discovering Agents

zac_kentonAug 18, 2022, 5:33 PM

73 points

11 comments6 min readLW link

Runaway Optimizers in Mind Space

silentbobJul 16, 2023, 2:26 PM

16 points

0 comments12 min readLW link

Interview with Bill O’Rourke—Russian Corruption, Putin, Applied Ethics, and More

JohnGreerOct 27, 2024, 5:11 PM

3 points

0 comments6 min readLW link

Some Problems with Ordinal Optimization Frame

Mateusz BagińskiMay 6, 2024, 5:28 AM

9 points

0 comments7 min readLW link

Architecture-aware optimisation: train ImageNet and more without hyperparameters

Chris MingardApr 22, 2023, 9:50 PM

6 points

2 comments2 min readLW link

Non-resolve as Resolve

Linda LinseforsJul 10, 2018, 11:31 PM

15 points

1 comment2 min readLW link

When Can Optimization Be Done Safely?

StrivingForLegibilityDec 30, 2023, 1:24 AM

12 points

0 comments3 min readLW link

Siren worlds and the perils of over-optimised search

Stuart_ArmstrongApr 7, 2014, 11:00 AM

83 points

418 comments7 min readLW link

Could Things Be Very Different?—How Historical Inertia Might Blind Us To Optimal Solutions

James Stephen BrownSep 11, 2024, 9:53 AM

5 points

0 comments8 min readLW link

(nonzerosum.games)

Optimisation Measures: Desiderata, Impossibility, Proposals

mattmacdermott and Alexander Gietelink Oldenziel

Aug 7, 2023, 3:52 PM

36 points

9 comments1 min readLW link

Extinction Risks from AI: Invisible to Science?

VojtaKovarik, Chris van Merwijk and Ida Mattsson

Feb 21, 2024, 6:07 PM

24 points

7 comments1 min readLW link

(arxiv.org)

Safety Data Sheets for Optimization Processes

StrivingForLegibilityJan 4, 2024, 11:30 PM

15 points

1 comment4 min readLW link

Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity

zhanpeng_zhouJul 20, 2023, 5:38 PM

22 points

13 comments3 min readLW link

(openreview.net)

The Carnot Engine of Economics

StrivingForLegibilityAug 9, 2024, 3:59 PM

5 points

0 comments5 min readLW link

Visual demonstration of Optimizer’s curse

Roman MalovNov 30, 2024, 7:34 PM

25 points

3 comments7 min readLW link

Optimization Markets

StrivingForLegibilityDec 30, 2023, 1:24 AM

13 points

2 comments2 min readLW link

MONA: Managed Myopia with Approval Feedback

Seb Farquhar, David Lindner and Rohin Shah

Jan 23, 2025, 12:24 PM

80 points

29 comments9 min readLW link

Understanding Gradient Hacking

peterbarnettDec 10, 2021, 3:58 PM

41 points

5 comments30 min readLW link

Evolutions Building Evolutions: Layers of Generate and Test

plexFeb 5, 2021, 6:21 PM

12 points

1 comment6 min readLW link

I missed the crux of the alignment problem the whole time

zeshenAug 13, 2022, 10:11 AM

53 points

7 comments3 min readLW link

Perils of optimizing in social contexts

owencbJun 16, 2022, 5:40 PM

50 points

1 comment2 min readLW link

Aligning a toy model of optimization

paulfchristianoJun 28, 2019, 8:23 PM

53 points

25 comments3 min readLW link

The Gears of Argmax

StrivingForLegibilityJan 4, 2024, 11:30 PM

11 points

0 comments3 min readLW link

Is General Intelligence “Compact”?

DragonGodJul 4, 2022, 1:27 PM

27 points

6 comments22 min readLW link

The Limits of Automation

milkandcigarettesJun 23, 2022, 6:03 PM

5 points

1 comment5 min readLW link

(milkandcigarettes.com)

Breaking Down Goal-Directed Behaviour

Oliver SourbutJun 16, 2022, 6:45 PM

11 points

1 comment2 min readLW link

Accidental Optimizers

aysajanSep 22, 2021, 1:27 PM

7 points

2 comments3 min readLW link

Interpretable by Design—Constraint Sets with Disjoint Limit Points

Ronak_MehtaMay 8, 2025, 9:08 PM

23 points

0 comments9 min readLW link

(ronakrm.github.io)

Optimization happens inside the mind, not in the world

azsantoskJun 3, 2023, 9:36 PM

17 points

10 comments5 min readLW link

Plans Are Predictions, Not Optimization Targets

johnswentworthOct 20, 2022, 9:17 PM

108 points

20 comments4 min readLW link 1 review

Optimization and the Singularity

Eliezer YudkowskyJun 23, 2008, 5:55 AM

41 points

21 comments9 min readLW link

Demons in Imperfect Search

johnswentworthFeb 11, 2020, 8:25 PM

110 points

21 comments3 min readLW link

Interlude: But Who Optimizes The Optimizer?

Paul BricmanSep 23, 2022, 3:30 PM

15 points

0 comments10 min readLW link

Adam Optimizer Causes Privileged Basis in Transformer LM Residual Stream

Diego Caples and rrenaud

Sep 6, 2024, 5:55 PM

70 points

7 comments4 min readLW link

Observing Optimization

Eliezer YudkowskyNov 21, 2008, 5:39 AM

12 points

28 comments6 min readLW link

Optimization and Adequacy in Five Bullets

james.lucassenJun 6, 2022, 5:48 AM

35 points

2 comments4 min readLW link

(jlucassen.com)

Optimizing crop planting with mixed integer linear programming in Stardew Valley

hapaninApr 5, 2022, 6:42 PM

68 points

4 comments7 min readLW link

Hypothesis: gradient descent prefers general circuits

Quintin PopeFeb 8, 2022, 9:12 PM

46 points

26 comments11 min readLW link

The Human’s Role in Mesa Optimization

silentbobMay 9, 2024, 12:07 PM

5 points

0 comments2 min readLW link

One bit of observation can unlock many of optimization—but at what cost?

dr_sApr 29, 2023, 10:53 AM

42 points

4 comments5 min readLW link

(Structural) Stability of Coupled Optimizers

Paul BricmanSep 30, 2022, 11:28 AM

25 points

0 comments10 min readLW link

Thinking about maximization and corrigibility

James PayorApr 21, 2023, 9:22 PM

63 points

4 comments5 min readLW link

Bridging Expected Utility Maximization and Optimization

Daniel HerrmannAug 5, 2022, 8:18 AM

25 points

5 comments14 min readLW link

Tessellating Hills: a toy model for demons in imperfect search

DaemonicSigilFeb 20, 2020, 12:12 AM

97 points

18 comments2 min readLW link

What’s General-Purpose Search, And Why Might We Expect To See It In Trained ML Systems?

johnswentworthAug 15, 2022, 10:48 PM

156 points

18 comments10 min readLW link

Wildfire of strategicness

TsviBTJun 5, 2023, 1:59 PM

38 points

19 comments1 min readLW link

Surprising examples of non-human optimization

Jan_RzymkowskiJun 14, 2015, 5:05 PM

31 points

9 comments1 min readLW link

The Three Warnings of the Zentradi

Trevor Hill-HandNov 21, 2024, 8:28 PM

13 points

1 comment5 min readLW link

Goldilocks and the Three Optimisers

dkl9Aug 17, 2023, 6:15 PM

−10 points

0 comments5 min readLW link

(dkl9.net)

Worse Than Random

Eliezer YudkowskyNov 11, 2008, 7:01 PM

46 points

102 comments12 min readLW link

Notes on Antelligence

AurigenaMay 13, 2023, 6:38 PM

2 points

0 comments9 min readLW link

Transforming myopic optimization to ordinary optimization—Do we want to seek convergence for myopic optimization problems?

tailcalledDec 11, 2021, 8:38 PM

12 points

1 comment5 min readLW link

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning

Roger DearnaleyFeb 21, 2023, 9:05 AM

10 points

1 comment23 min readLW link

Efficient Cross-Domain Optimization

Eliezer YudkowskyOct 28, 2008, 4:33 PM

55 points

38 comments5 min readLW link

No free lunch theorem is irrelevant

CatneeOct 4, 2022, 12:21 AM

18 points

7 comments1 min readLW link

Hedonic asymmetries

paulfchristianoJan 26, 2020, 2:10 AM

98 points

22 comments2 min readLW link

(sideways-view.com)

The slingshot helps with learning

Wilson WuOct 31, 2024, 11:18 PM

33 points

0 comments8 min readLW link

[Question] What are examples of someone doing a lot of work to find the best of something?

chanamessingerJul 27, 2023, 3:58 PM

29 points

16 comments1 min readLW link

Satisficers want to become maximisers

Stuart_ArmstrongOct 21, 2011, 4:27 PM

38 points

70 comments1 min readLW link

Don’t design agents which exploit adversarial inputs

TurnTrout and Garrett Baker

Nov 18, 2022, 1:48 AM

72 points

64 comments12 min readLW link

No comments.

Optimization

Measuring Optimization Power

Further Reading & References

See also