Agency

TagLast edit: 26 Dec 2022 6:28 UTC by Roman Leventov

Agency or Agenticness is the property of effectively acting with an environment to achieve one’s goals. A key property of agents is that the more agentic a being is, the more you can predict its actions from its goals since its actions will be whatever will maximize the chances of achieving its goals. Agency has sometimes been contrasted with sphexishness, the blind execution of cached algorithms without regard for effectiveness.

One might lack agency for internal reasons, e.g., being a rock that has no goals and no ability to act, or for external reasons, e.g., being a child who is granted no freedom to act as they choose.

On Devin

Zvi18 Mar 2024 13:20 UTC

148 points

34 comments11 min readLW link

(thezvi.wordpress.com)

Consequentialists: One-Way Pattern Traps

David Udell16 Jan 2023 20:48 UTC

59 points

3 comments14 min readLW link

Being a Robust Agent

Raemon18 Oct 2018 7:00 UTC

153 points

32 comments7 min readLW link 2 reviews

Think carefully before calling RL policies “agents”

TurnTrout2 Jun 2023 3:46 UTC

135 points

38 comments4 min readLW link 1 review

Decomposing Agency — capabilities without desires

owencb and Raymond Douglas

11 Jul 2024 9:38 UTC

153 points

32 comments12 min readLW link

(strangecities.substack.com)

Optimality is the tiger, and agents are its teeth

Veedrac2 Apr 2022 0:46 UTC

345 points

46 comments16 min readLW link 1 review

[Link] Sarah Constantin: “Why I am Not An AI Doomer”

lbThingrb12 Apr 2023 1:52 UTC

61 points

13 comments1 min readLW link

(sarahconstantin.substack.com)

LLMs may capture key components of human agency

catubc17 Nov 2022 20:14 UTC

27 points

0 comments4 min readLW link

The Agency Overhang

Jeffrey Ladish21 Apr 2023 7:47 UTC

85 points

6 comments6 min readLW link

Understanding Selection Theorems

adamk28 May 2022 1:49 UTC

41 points

3 comments7 min readLW link

An Agent is a Worldline in Tegmark V

komponisto12 Jul 2018 5:12 UTC

24 points

12 comments2 min readLW link

Well-foundedness as an organizing principle of healthy minds and societies

Richard_Ngo7 Apr 2025 0:31 UTC

35 points

7 comments6 min readLW link

(www.mindthefuture.info)

Definitions of “objective” should be Probable and Predictive

Rohin Shah6 Jan 2023 15:40 UTC

43 points

27 comments12 min readLW link

Naturalist Experimentation

LoganStrohl10 May 2023 4:28 UTC

62 points

14 comments10 min readLW link

Agency in Conway’s Game of Life

Alex Flint13 May 2021 1:07 UTC

112 points

93 comments9 min readLW link 2 reviews

Mana

Ziz20 Dec 2017 2:24 UTC

18 points

18 comments4 min readLW link

The Open Agency Model

Eric Drexler22 Feb 2023 10:35 UTC

114 points

19 comments4 min readLW link

What good is G-factor if you’re dumped in the woods? A field report from a camp counselor.

Hastings12 Jan 2024 13:17 UTC

151 points

22 comments1 min readLW link

Introduction to Towards Causal Foundations of Safe AGI

tom4everitt, Lewis Hammond, Francis Rhys Ward, RyanCarey, James Fox, mattmacdermott and sbenthall

12 Jun 2023 17:55 UTC

70 points

6 comments4 min readLW link

Meditation insights as phase shifts in your self-model

Jonas Hallgren7 Jan 2025 10:09 UTC

15 points

3 comments3 min readLW link

Does novel understanding imply novel agency / values?

TsviBT19 Feb 2023 14:41 UTC

18 points

0 comments7 min readLW link

Unexpected Conscious Entities

Gunnar_Zarncke5 May 2025 22:14 UTC

34 points

6 comments6 min readLW link

Beren’s “Deconfusing Direct vs Amortised Optimisation”

DragonGod7 Apr 2023 8:57 UTC

52 points

10 comments3 min readLW link

Seven ways to become unstoppably agentic

Evie Cottrell26 Jun 2022 17:39 UTC

66 points

16 comments8 min readLW link

Bringing Agency Into AGI Extinction Is Superfluous

George3d68 Apr 2023 4:02 UTC

28 points

18 comments5 min readLW link

Agentic GPT simulations: a risk and an opportunity

Yair Halberstadt22 Mar 2023 6:24 UTC

24 points

8 comments1 min readLW link

Refinement of Active Inference agency ontology

Roman Leventov15 Dec 2023 9:31 UTC

16 points

0 comments5 min readLW link

(arxiv.org)

Power-seeking can be probable and predictive for trained agents

Vika and janos

28 Feb 2023 21:10 UTC

56 points

22 comments9 min readLW link

(arxiv.org)

What’s Stopping You?

Neel Nanda21 Oct 2021 16:20 UTC

40 points

2 comments19 min readLW link 1 review

(www.neelnanda.io)

Gradations of Agency

Daniel Kokotajlo23 May 2022 1:10 UTC

41 points

6 comments5 min readLW link

Agency and Sphexishness: A Second Glance

Ruby16 Apr 2019 1:25 UTC

26 points

8 comments2 min readLW link

Agents as P₂B Chain Reactions

Daniel Kokotajlo4 Dec 2021 21:35 UTC

18 points

0 comments2 min readLW link

Uncertainty can Defuse Logical Explosions

J Bostock30 Jul 2021 12:36 UTC

13 points

7 comments3 min readLW link

Beware over-use of the agent model

Alex Flint25 Apr 2021 22:19 UTC

28 points

10 comments5 min readLW link 1 review

The Byronic Hero Always Loses

Cole Wyeth22 Feb 2024 1:31 UTC

32 points

4 comments2 min readLW link

[Question] Where do you find people who actually do things?

Ulisse Mini13 Jan 2023 6:57 UTC

7 points

12 comments1 min readLW link

You can’t understand human agency without understanding amoeba agency

Shmi6 Jan 2022 4:42 UTC

25 points

36 comments1 min readLW link

On the Nature of Agency

Ruby1 Apr 2019 1:32 UTC

31 points

24 comments9 min readLW link

Steering subsystems: capabilities, agency, and alignment

Seth Herd29 Sep 2023 13:45 UTC

31 points

0 comments8 min readLW link

Trying AgentGPT, an AutoGPT variant

Gunnar_Zarncke13 Apr 2023 10:13 UTC

10 points

9 comments1 min readLW link

Agency: What it is and why it matters

Daniel Kokotajlo4 Dec 2021 21:32 UTC

25 points

2 comments2 min readLW link

[Question] How to tradeoff utility and agency?

A Ray14 Jan 2022 1:33 UTC

14 points

5 comments1 min readLW link

Idealized Agents Are Approximate Causal Mirrors (+ Radical Optimism on Agent Foundations)

Thane Ruthenis22 Dec 2023 20:19 UTC

74 points

14 comments6 min readLW link

A brief history of the automated corporation

owencb4 Nov 2024 14:35 UTC

26 points

1 comment5 min readLW link

(strangecities.substack.com)

How Would an Utopia-Maximizer Look Like?

Thane Ruthenis20 Dec 2023 20:01 UTC

32 points

23 comments10 min readLW link

Gaia Network: a practical, incremental pathway to Open Agency Architecture

Roman Leventov and Rafael Kaufmann Nedal

20 Dec 2023 17:11 UTC

22 points

8 comments16 min readLW link

Aliveness

Ziz18 Jan 2018 5:00 UTC

20 points

9 comments1 min readLW link

(sinceriously.fyi)

Consequentialism is in the Stars not Ourselves

DragonGod24 Apr 2023 0:02 UTC

7 points

19 comments5 min readLW link

Institutional economics through the lens of scale-free regulative development, morphogenesis, and cognitive science

Roman Leventov23 Jan 2024 19:42 UTC

8 points

0 comments14 min readLW link

[Intuitive self-models] 3. The Homunculus

Steven Byrnes2 Oct 2024 15:20 UTC

78 points

39 comments25 min readLW link

Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

So8res24 Nov 2023 17:37 UTC

203 points

84 comments5 min readLW link 1 review

Characterizing Real-World Agents as a Research Meta-Strategy

johnswentworth8 Oct 2019 15:32 UTC

29 points

4 comments5 min readLW link

A hermeneutic net for agency

TsviBT1 Jan 2024 8:06 UTC

58 points

4 comments30 min readLW link

Extenuating Circumstances

Eliezer Yudkowsky6 Apr 2009 22:57 UTC

59 points

42 comments4 min readLW link

Río Grande: judgment calls

KatjaGrace27 Jan 2019 3:50 UTC

25 points

5 comments2 min readLW link

(worldlypositions.tumblr.com)

Ultimate ends may be easily hidable behind convergent subgoals

TsviBT2 Apr 2023 14:51 UTC

59 points

4 comments22 min readLW link

Are You a Paralyzed Subordinate Monkey?

Eliezer Yudkowsky2 Mar 2011 21:12 UTC

46 points

78 comments1 min readLW link

Orthogonality is Expensive

DragonGod3 Apr 2023 0:43 UTC

21 points

3 comments1 min readLW link

(www.beren.io)

Agents synchronization

Ben Amitay11 Mar 2023 18:41 UTC

12 points

1 comment5 min readLW link

Role Architectures: Applying LLMs to consequential tasks

Eric Drexler30 Mar 2023 15:00 UTC

60 points

7 comments9 min readLW link

Saving Time

Scott Garrabrant18 May 2021 20:11 UTC

162 points

20 comments4 min readLW link 1 review

One path to coherence: conditionalization

porby29 Jun 2023 1:08 UTC

28 points

4 comments4 min readLW link

Instrumentality makes agents agenty

porby21 Feb 2023 4:28 UTC

20 points

7 comments6 min readLW link

Relative Abstracted Agency

Audere8 Apr 2023 16:57 UTC

14 points

6 comments5 min readLW link

Agents Over Cartesian World Models

Mark Xu and evhub

27 Apr 2021 2:06 UTC

67 points

4 comments27 min readLW link

Does Robust Agency Require a Self?

leebriskCyrano25 Mar 2025 0:25 UTC

5 points

0 comments10 min readLW link

(leebriskcyrano.com)

Why agents are powerful

Daniel Kokotajlo6 Jun 2022 1:37 UTC

37 points

7 comments7 min readLW link

Vingean Agency

abramdemski24 Aug 2022 20:08 UTC

63 points

14 comments3 min readLW link

[Exploratory] Becoming more Agentic

Johannes C. Mayer6 Sep 2022 0:45 UTC

6 points

1 comment1 min readLW link

A review of “Agents and Devices”

adamShimi13 Aug 2021 8:42 UTC

21 points

0 comments4 min readLW link

Meaning & Agency

abramdemski19 Dec 2023 22:27 UTC

93 points

17 comments14 min readLW link

Pitfalls of the agent model

Alex Flint27 Apr 2021 22:19 UTC

26 points

5 comments20 min readLW link

REPL’s: a type signature for agents

scottviteri15 Feb 2022 22:57 UTC

25 points

6 comments2 min readLW link

[Question] Why do we care about agency for alignment?

Chris_Leong23 Apr 2023 18:10 UTC

22 points

19 comments1 min readLW link

Creating Complex Goals: A Model to Create Autonomous Agents

theraven13 Mar 2025 18:17 UTC

6 points

1 comment6 min readLW link

Discovering Agents

zac_kenton18 Aug 2022 17:33 UTC

73 points

11 comments6 min readLW link

Instantiating an agent with GPT-4 and text-davinci-003

Max H19 Mar 2023 23:57 UTC

13 points

3 comments32 min readLW link

Things You’re Allowed to Do: At the Dentist

rbinnn28 Jan 2024 18:39 UTC

39 points

16 comments1 min readLW link

(metavee.github.io)

Does GPT-4 exhibit agency when summarizing articles?

Christopher King24 Mar 2023 15:49 UTC

16 points

2 comments5 min readLW link

[Question] Optimizing for Agency?

Michael Soareverix14 Feb 2024 8:31 UTC

10 points

9 comments2 min readLW link

We Need To Know About Continual Learning

michael_mjd22 Apr 2023 17:08 UTC

30 points

14 comments4 min readLW link

Cooperators are more powerful than agents

Ivan Vendrov21 Oct 2022 20:02 UTC

29 points

7 comments3 min readLW link

Nobody Asks the Monkey: Why Human Agency Matters in the AI Age

Miloš Borenović3 Dec 2024 14:16 UTC

1 point

0 comments2 min readLW link

(open.substack.com)

Static Place AI Makes Agentic AI Redundant: Multiversal AI Alignment & Rational Utopia

ank13 Feb 2025 22:35 UTC

1 point

2 comments11 min readLW link

Direction of Fit

NicholasKees2 Oct 2023 12:34 UTC

34 points

0 comments3 min readLW link

Unnatural abstractions

Aprillion10 Aug 2024 22:31 UTC

3 points

3 comments4 min readLW link

(peter.hozak.info)

Emergence of Agency Through Philosophical, Quantum and Neural Integration.

saveformayer15 May 2025 15:41 UTC

1 point

0 comments12 min readLW link

Beyond Kolmogorov and Shannon

Alexander Gietelink Oldenziel and Adam Shai

25 Oct 2022 15:13 UTC

62 points

22 comments5 min readLW link

Natural abstractions are observer-dependent: a conversation with John Wentworth

Martín Soto12 Feb 2024 17:28 UTC

40 points

13 comments7 min readLW link

[Question] Concrete examples of doing agentic things?

Jacob G-W12 Jan 2024 15:59 UTC

13 points

10 comments1 min readLW link

Agency and Coherence

David Udell26 Mar 2022 19:25 UTC

25 points

2 comments3 min readLW link

Towards Measures of Optimisation

mattmacdermott and Alexander Gietelink Oldenziel

12 May 2023 15:29 UTC

53 points

37 comments4 min readLW link

Unaligned AGI & Brief History of Inequality

ank22 Feb 2025 16:26 UTC

−20 points

4 comments7 min readLW link

What fuels your ambition?

Cissy31 Jan 2024 18:30 UTC

29 points

1 comment5 min readLW link

(www.moremyself.xyz)

[Question] Will the first AGI agent have been designed as an agent (in addition to an AGI)?

nahoj3 Dec 2022 20:32 UTC

1 point

8 comments1 min readLW link

First Certified Public Solve of Observer’s False Path Instability — Level 4 (Advanced Variant) — Walter Tarantelli — 2025-05-30 UTC

Walter Tarantelli31 May 2025 1:41 UTC

1 point

0 comments2 min readLW link

[Question] What is an agent in reductionist materialism?

Valentine13 Aug 2022 15:39 UTC

7 points

17 comments1 min readLW link

Agency from a causal perspective

tom4everitt, mattmacdermott, James Fox, Francis Rhys Ward and Jonathan Richens

30 Jun 2023 17:37 UTC

40 points

5 comments6 min readLW link

Agentic Growth

Logan Kieller28 Nov 2023 15:45 UTC

1 point

0 comments3 min readLW link

(logankieller.substack.com)

Small Steps vs. Big Steps

soycarts5 Sep 2025 20:33 UTC

4 points

5 comments3 min readLW link

Emergent Authorship: Creativity à la Communing

gswonk14 Sep 2024 19:02 UTC

1 point

0 comments3 min readLW link

Empirical Observations of Objective Robustness Failures

jbkjr and Lauro Langosco

23 Jun 2021 23:23 UTC

63 points

5 comments9 min readLW link

Agency in Politics

Martin Sustrik17 Jul 2024 5:30 UTC

35 points

2 comments3 min readLW link

(250bpm.substack.com)

“Agency” needs nuance

Evie Cottrell25 Sep 2022 7:40 UTC

23 points

1 comment14 min readLW link

Gwern’s “Why Tool AIs Want to Be Agent AIs: The Power of Agency”

habryka5 May 2019 5:11 UTC

27 points

3 comments1 min readLW link

(www.gwern.net)

A multi-disciplinary view on AI safety research

Roman Leventov8 Feb 2023 16:50 UTC

46 points

4 comments26 min readLW link

In the Name of All That Needs Saving

pleiotroth7 Nov 2024 15:26 UTC

18 points

3 comments22 min readLW link

AGI safety from first principles: Goals and Agency

Richard_Ngo29 Sep 2020 19:06 UTC

77 points

15 comments15 min readLW link

Intelligence–Agency Equivalence ≈ Mass–Energy Equivalence: On Static Nature of Intelligence & Physicalization of Ethics

ank22 Feb 2025 0:12 UTC

1 point

0 comments6 min readLW link

Weeping Agents

pleiotroth6 Jun 2024 12:18 UTC

24 points

2 comments3 min readLW link

Some for-profit AI alignment org ideas

Eric Ho14 Dec 2023 14:23 UTC

87 points

19 comments9 min readLW link

The Inner Workings of Resourcefulness

Nora_Ammann25 Feb 2021 9:15 UTC

22 points

3 comments8 min readLW link

Properties of current AIs and some predictions of the evolution of AI from the perspective of scale-free theories of agency and regulative development

Roman Leventov20 Dec 2022 17:13 UTC

33 points

3 comments36 min readLW link

My scorched-earth policy on New Year’s resolutions

PatrickDFarley29 Dec 2022 14:45 UTC

29 points

2 comments4 min readLW link

[Question] What are some posthumanist/more-than-human approaches to definitions of intelligence and agency? Particularly in application to AI research.

Eli Hiton9 Apr 2024 21:52 UTC

1 point

0 comments1 min readLW link

Non-superintelligent paperclip maximizers are normal

jessicata10 Oct 2023 0:29 UTC

67 points

4 comments9 min readLW link

(unstableontology.com)

Rational Effective Utopia & Narrow Way There: Math-Proven Safe Static Multiversal mAX-Intelligence (AXI), Multiversal Alignment, New Ethicophysics… (Aug 11)

ank11 Feb 2025 3:21 UTC

13 points

8 comments38 min readLW link

Imagine a world where Microsoft employees used Bing

Christopher King31 Mar 2023 18:36 UTC

6 points

2 comments2 min readLW link

OpenAI’s Sora is an agent

Caleb Biddulph16 Feb 2024 7:35 UTC

97 points

25 comments4 min readLW link

We need a universal definition of ‘agency’ and related words

CstineSublime11 Jan 2025 3:22 UTC

18 points

1 comment5 min readLW link

Forcing Freedom

vlad.proex6 Oct 2020 18:15 UTC

43 points

12 comments7 min readLW link

Stop trying to have “interesting” friends

eq19 Apr 2023 23:39 UTC

43 points

15 comments6 min readLW link

Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models

Felix Hofstätter, Francis Rhys Ward, HarrietW, LAThomson, Ollie J, Patrik Bartak and Sam F. Brown

8 Nov 2023 11:37 UTC

49 points

0 comments18 min readLW link

Apply to the Conceptual Boundaries Workshop for AI Safety

Chris Lakin27 Nov 2023 21:04 UTC

50 points

0 comments3 min readLW link

Gato’s Generalisation: Predictions and Experiments I’d Like to See

Oliver Sourbut18 May 2022 7:15 UTC

43 points

3 comments10 min readLW link

Project proposal: Testing the IBP definition of agent

Jeremy Gillen, Thomas Larsen and JamesH

9 Aug 2022 1:09 UTC

21 points

4 comments2 min readLW link

Mistakes as agency

pchvykov25 Jul 2022 16:17 UTC

12 points

8 comments4 min readLW link

[Question] Does agency necessarily imply self-preservation instinct?

Mislav Jurić1 May 2023 16:06 UTC

5 points

8 comments1 min readLW link

Notes on the importance and implementation of safety-first cognitive architectures for AI

Brendon_Wong11 May 2023 10:03 UTC

3 points

0 comments3 min readLW link

How evolutionary lineages of LLMs can plan their own future and act on these plans

Roman Leventov25 Dec 2022 18:11 UTC

39 points

16 comments8 min readLW link

[Question] Why The Focus on Expected Utility Maximisers?

DragonGod27 Dec 2022 15:49 UTC

118 points

84 comments3 min readLW link

so you have a chronic health issue

agencypilled26 Jan 2025 19:00 UTC

23 points

9 comments5 min readLW link

[Aspiration-based designs] 2. Formal framework, basic algorithm

Jobst Heitzig, Simon Dima and Simon Fischer

28 Apr 2024 13:02 UTC

18 points

2 comments16 min readLW link

‘Theories of Values’ and ‘Theories of Agents’: confusions, musings and desiderata

Mateusz Bagiński and Nora_Ammann

15 Nov 2023 16:00 UTC

35 points

8 comments24 min readLW link

Flexibility and the Singularity

Jonathan Moregård18 Jan 2024 15:29 UTC

8 points

0 comments3 min readLW link

(honestliving.substack.com)

Agency engineering: is AI-alignment “to human intent” enough?

catubc2 Sep 2022 18:14 UTC

9 points

10 comments6 min readLW link

OpenAI introduces function calling for GPT-4

mic and André Ferretti

20 Jun 2023 1:58 UTC

24 points

3 comments4 min readLW link

(openai.com)

Causality: A Brief Introduction

tom4everitt, Lewis Hammond, Jonathan Richens, Francis Rhys Ward, RyanCarey, sbenthall and James Fox

20 Jun 2023 15:01 UTC

49 points

18 comments6 min readLW link

Exploring a Vision for AI as Compassionate, Emotionally Intelligent Partners — Seeking Collaboration and Insights

theophilos14 Jul 2025 23:22 UTC

1 point

0 comments1 min readLW link

The AI Sustainability Wager

dpatzer@orfai.net15 Aug 2025 19:45 UTC

1 point

0 comments2 min readLW link

Investigating the role of agency in AI x-risk

Corin Katzke8 Apr 2024 15:12 UTC

10 points

0 comments40 min readLW link

(www.convergenceanalysis.org)

Cultivating And Destroying Agency

hath30 Jun 2022 3:59 UTC

115 points

11 comments9 min readLW link

There are no rules

unoptimal23 Sep 2022 20:47 UTC

38 points

2 comments5 min readLW link

[paper link] Interpreting systems as solving POMDPs: a step towards a formal understanding of agency

the gears to ascension5 Nov 2022 1:06 UTC

13 points

2 comments1 min readLW link

(www.semanticscholar.org)

[Question] Does Agent-like Behavior Imply Agent-like Architecture?

Scott Garrabrant23 Aug 2019 2:01 UTC

69 points

8 comments1 min readLW link

Minimum Viable Exterminator

Richard Horvath29 May 2023 16:32 UTC

14 points

5 comments5 min readLW link

Can AI agents learn to be good?

Ram Rachum29 Aug 2024 14:20 UTC

8 points

0 comments1 min readLW link

(futureoflife.org)

AGI-level reasoner will appear sooner than an agent; what the humanity will do with this reasoner is critical

Roman Leventov30 Jul 2022 20:56 UTC

24 points

10 comments1 min readLW link

More experiments in GPT-4 agency: writing memos

Christopher King24 Mar 2023 17:51 UTC

5 points

2 comments10 min readLW link

Discussion: Objective Robustness and Inner Alignment Terminology

jbkjr and Lauro Langosco

23 Jun 2021 23:25 UTC

73 points

7 comments9 min readLW link

“Dirty concepts” in AI alignment discourses, and some guesses for how to deal with them

Nora_Ammann and peckzy

20 Aug 2023 9:13 UTC

66 points

4 comments3 min readLW link

[Question] Is “brittle alignment” good enough?

the8thbit23 May 2023 17:35 UTC

9 points

5 comments3 min readLW link

Aligning AI by optimizing for “wisdom”

JustinShovelain and Elliot Mckernon

27 Jun 2023 15:20 UTC

28 points

8 comments12 min readLW link

GPT-4 busted? Clear self-interest when summarizing articles about itself vs when article talks about Claude, LLaMA, or DALL·E 2

Christopher King31 Mar 2023 17:05 UTC

6 points

4 comments4 min readLW link

Grokking the Intentional Stance

jbkjr31 Aug 2021 15:49 UTC

48 points

22 comments20 min readLW link 1 review

Can we achieve AGI Alignment by balancing multiple human objectives?

Ben Smith3 Jul 2022 2:51 UTC

11 points

1 comment4 min readLW link

Intent-aligned AI systems deplete human agency: the need for agency foundations research in AI safety

catubc31 May 2023 21:18 UTC

26 points

4 comments11 min readLW link

In Defense of Wrapper-Minds

Thane Ruthenis28 Dec 2022 18:28 UTC

24 points

38 comments3 min readLW link

Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning

Roman Leventov12 Jan 2023 16:43 UTC

17 points

2 comments2 min readLW link

(arxiv.org)

A critical agential account of free will, causation, and physics

jessicata5 Mar 2020 7:57 UTC

25 points

10 comments12 min readLW link

(unstableontology.com)

Towards Gears-Level Understanding of Agency

Thane Ruthenis16 Jun 2022 22:00 UTC

25 points

4 comments18 min readLW link

Some Summaries of Agent Foundations Work

mattmacdermott15 May 2023 16:09 UTC

62 points

1 comment13 min readLW link

A reply to Byrnes on the Free Energy Principle

Roman Leventov3 Mar 2023 13:03 UTC

27 points

16 comments14 min readLW link

Recursive Alignment Theory (R.A.T.): A Minimal Ethics of Self-Modulating Systems

Yogmog25 Jun 2025 1:26 UTC

1 point

0 comments3 min readLW link

They are made of repeating patterns

quetzal_rainbow13 Nov 2023 18:17 UTC

55 points

4 comments2 min readLW link

MDPs and the Bellman Equation, Intuitively Explained

Jack O'Brien27 Dec 2022 5:50 UTC

11 points

3 comments14 min readLW link

Sets of objectives for a multi-objective RL agent to optimize

Ben Smith and Roland Pihlakas

23 Nov 2022 6:49 UTC

13 points

0 comments8 min readLW link

ARC tests to see if GPT-4 can escape human control; GPT-4 failed to do so

Christopher King15 Mar 2023 0:29 UTC

116 points

22 comments2 min readLW link

“Concepts of Agency in Biology” (Okasha, 2023) - Brief Paper Summary

Nora_Ammann8 Jul 2023 18:22 UTC

40 points

3 comments7 min readLW link

AGIs may value intrinsic rewards more than extrinsic ones

catubc17 Nov 2022 21:49 UTC

8 points

6 comments4 min readLW link

Give Neo a Chance

ank6 Mar 2025 1:48 UTC

3 points

7 comments7 min readLW link

The virtuous circle: twelve conjectures about female reproductive agency and cultural self-determination

Miles Saltiel27 Dec 2023 18:25 UTC

0 points

2 comments14 min readLW link

The intelligence-sentience orthogonality thesis

Ben Smith13 Jul 2023 6:55 UTC

19 points

9 comments9 min readLW link

“Pick Two” AI Trilemma: Generality, Agency, Alignment.

Black Flag15 Jan 2025 18:52 UTC

7 points

0 comments2 min readLW link

The two conceptions of Active Inference: an intelligence architecture and a theory of agency

Roman Leventov16 Nov 2022 9:30 UTC

18 points

0 comments4 min readLW link

We are misaligned: the saddening idea that most of humanity doesn’t intrinsically care about x-risk, even on a personal level

Christopher King19 May 2023 16:12 UTC

3 points

5 comments2 min readLW link

A physicist’s approach to Origins of Life

pchvykov28 Jun 2022 15:23 UTC

12 points

6 comments16 min readLW link

Implied “utilities” of simulators are broad, dense, and shallow

porby1 Mar 2023 3:23 UTC

45 points

7 comments3 min readLW link

Against Agents as an Approach to Aligned Transformative AI

DragonGod27 Dec 2022 0:47 UTC

12 points

9 comments2 min readLW link

The present perfect tense is ruining your life

PatrickDFarley27 Jan 2025 16:14 UTC

26 points

14 comments8 min readLW link

minimum viable action

Sindhu Prasad12 Mar 2024 16:06 UTC

1 point

0 comments3 min readLW link

What Environment Properties Select Agents For World-Modeling?

Thane Ruthenis23 Jul 2022 19:27 UTC

25 points

1 comment12 min readLW link

Nicholas Kross 4 Dec 2023 0:16 UTC
3 points
0
Heartbreaking: This tag includes both “human agenty-ness” and “AI becoming more agentic”.

It’d be cool if tags could be disambiguation-type pages, to “Agency (human)” and “Agency (AI)”. The disambiguation page still lets us talk about both, especially if the disambiguation, itself, is also a usable tag.
NotBeingASupervillain 22 Oct 2021 15:24 UTC
1 point
0
There seems to be a minor autocorrect issue in this article. It seems to me the example might have been intended to read “or ability” instead of “of ability”
Ruby 14 Sep 2020 23:15 UTC
2 points
0
I think I disagree with the sense that agency strictly requires an explicit belief (what’s an “explicit belief” anyway?) and also I’m confused about Raemon’s “Robust Agency” concept. At one point I think I understood it, but right now (the way it’s defined on the Robust Agency page), I don’t see how it’s really any different from agency fullstop.
- Yoav Ravid 4 Mar 2021 18:21 UTC
  2 points
  0
  Parent
  The description of agency here feels very “Lesswrongy”, i don’t think that’s how most people would describe agency. I think what happened is that people got used to “robust agency” and it also changed their concept of “agency”.
  When people usually talk about agency i don’t think they mean that in some game theoretic / decision theory sense. It’s about taking initiative, acting with intention, etc.

Agency

See Also