Myopia

TagLast edit: 30 Dec 2024 11:19 UTC by Dakara

Myopia refers to short-sightedness in planning and decision-making processes. It describes a tendency to prioritize immediate or short-term outcomes while disregarding longer-term consequences.

The most extreme form of myopia occurs when an agent considers only immediate rewards, completely disregarding future consequences. In artificial intelligence contexts, a perfectly myopic agent would optimize solely for the current query or task without attempting to influence future outcomes.

Myopic agents demonstrate several notable properties:

Limited temporal scope in decision-making
Focus on immediate reward optimization
Reduced instrumental incentives

Partial Agency

abramdemski27 Sep 2019 22:04 UTC

76 points

18 comments9 min readLW link

The Credit Assignment Problem

abramdemski8 Nov 2019 2:50 UTC

107 points

40 comments17 min readLW link 1 review

Towards a mechanistic understanding of corrigibility

evhub22 Aug 2019 23:20 UTC

47 points

26 comments4 min readLW link

How LLMs are and are not myopic

janus25 Jul 2023 2:19 UTC

138 points

16 comments8 min readLW link

Open Problems with Myopia

Mark Xu and evhub

10 Mar 2021 18:38 UTC

67 points

16 comments8 min readLW link

Steering Behaviour: Testing for (Non-)Myopia in Language Models

Evan R. Murphy and Megan Kinniment

5 Dec 2022 20:28 UTC

40 points

19 comments10 min readLW link

Defining Myopia

abramdemski19 Oct 2019 21:32 UTC

32 points

18 comments8 min readLW link

LCDT, A Myopic Decision Theory

adamShimi and evhub

3 Aug 2021 22:41 UTC

57 points

50 comments15 min readLW link

Arguments against myopic training

Richard_Ngo9 Jul 2020 16:07 UTC

62 points

39 comments12 min readLW link

You can still fetch the coffee today if you’re dead tomorrow

davidad9 Dec 2022 14:06 UTC

97 points

19 comments5 min readLW link

The Parable of Predict-O-Matic

abramdemski15 Oct 2019 0:49 UTC

364 points

43 comments14 min readLW link 2 reviews

The Dualist Predict-O-Matic ($100 prize)

John_Maxwell17 Oct 2019 6:45 UTC

19 points

35 comments5 min readLW link

AXRP Episode 43 - David Lindner on Myopic Optimization with Non-myopic Approval

DanielFilan15 Jun 2025 1:20 UTC

12 points

0 comments56 min readLW link

Evan Hubinger on Homogeneity in Takeoff Speeds, Learned Optimization and Interpretability

Michaël Trazzi8 Jun 2021 19:20 UTC

28 points

0 comments55 min readLW link

2019 Review Rewrite: Seeking Power is Often Robustly Instrumental in MDPs

TurnTrout23 Dec 2020 17:16 UTC

35 points

0 comments4 min readLW link

(www.lesswrong.com)

MONA: Three Month Later—Updates and Steganography Without Optimization Pressure

David Lindner and Vikrant Varma

12 Apr 2025 23:15 UTC

31 points

0 comments5 min readLW link

Understanding and controlling auto-induced distributional shift

L Rudolf L13 Dec 2021 14:59 UTC

33 points

4 comments16 min readLW link

Bayesian Evolving-to-Extinction

abramdemski14 Feb 2020 23:55 UTC

44 points

13 comments5 min readLW link

Seeking Power is Often Convergently Instrumental in MDPs

TurnTrout and Logan Riggs

5 Dec 2019 2:33 UTC

160 points

39 comments17 min readLW link 2 reviews

(arxiv.org)

Random Thoughts on Predict-O-Matic

abramdemski17 Oct 2019 23:39 UTC

40 points

3 comments9 min readLW link

An overview of 11 proposals for building safe advanced AI

evhub29 May 2020 20:38 UTC

221 points

37 comments38 min readLW link 2 reviews

MONA: Managed Myopia with Approval Feedback

Seb Farquhar, David Lindner and Rohin Shah

23 Jan 2025 12:24 UTC

81 points

30 comments9 min readLW link

Why GPT wants to mesa-optimize & how we might change this

John_Maxwell19 Sep 2020 13:48 UTC

55 points

33 comments9 min readLW link

Self-Fulfilling Prophecies Aren’t Always About Self-Awareness

John_Maxwell18 Nov 2019 23:11 UTC

14 points

7 comments4 min readLW link

Thoughts on “Process-Based Supervision” / MONA

Steven Byrnes17 Jul 2023 14:08 UTC

74 points

4 comments23 min readLW link

Limiting an AGI’s Context Temporally

EulersApprentice17 Feb 2019 3:29 UTC

5 points

11 comments1 min readLW link

Acceptability Verification: A Research Agenda

David Udell and evhub

12 Jul 2022 20:11 UTC

50 points

0 comments1 min readLW link

(docs.google.com)

Laziness in AI

Richard Henage2 Sep 2022 17:04 UTC

13 points

5 comments1 min readLW link

GPT-4 aligning with acasual decision theory when instructed to play games, but includes a CDT explanation that’s incorrect if they differ

Christopher King23 Mar 2023 16:16 UTC

7 points

4 comments8 min readLW link

Transforming myopic optimization to ordinary optimization—Do we want to seek convergence for myopic optimization problems?

tailcalled11 Dec 2021 20:38 UTC

12 points

1 comment5 min readLW link

Underspecification of Oracle AI

Rubi J. Hudson, Adam Jermyn and Johannes Treutlein

15 Jan 2023 20:10 UTC

30 points

12 comments19 min readLW link

Generative, Episodic Objectives for Safe AI

Michael Glass5 Oct 2022 23:18 UTC

11 points

3 comments8 min readLW link

How complex are myopic imitators?

Vivek Hebbar8 Feb 2022 12:00 UTC

26 points

1 comment15 min readLW link

Graphical World Models, Counterfactuals, and Machine Learning Agents

Koen.Holtman17 Feb 2021 11:07 UTC

6 points

2 comments10 min readLW link

Interpretability’s Alignment-Solving Potential: Analysis of 7 Scenarios

Evan R. Murphy12 May 2022 20:01 UTC

58 points

0 comments59 min readLW link

Simulators

janus2 Sep 2022 12:45 UTC

670 points

168 comments41 min readLW link 8 reviews

(generative.ink)

Non-myopia stories

lberglund13 Nov 2023 17:52 UTC

29 points

10 comments7 min readLW link

AI safety via market making

evhub26 Jun 2020 23:07 UTC

73 points

45 comments1 min readLW link

Fighting Akrasia: Incentivising Action

Gordon Seidoh Worley29 Apr 2009 13:48 UTC

12 points

58 comments2 min readLW link

GPT-4 busted? Clear self-interest when summarizing articles about itself vs when article talks about Claude, LLaMA, or DALL·E 2

Christopher King31 Mar 2023 17:05 UTC

6 points

4 comments4 min readLW link

No comments.