Limits to Control

TagLast edit: Jul 2, 2025, 7:39 AM by Remmelt

Limits to Control is a field of research aiming to discover and verify:

Fundamental limits to controlling fully autonomous AI using any method of causation.
Threat models of AI convergent dynamics that cannot be sufficiently controlled (by 1.).
Impossibility theorems, by contradiction of ‘long-term AI safety’ with convergence result (2.)

~ ~ ~

Definitions

‘Fundamental limit’

A limit to the capacity to reach some states over other states. This limit is absolute – it is not just based on practical impediments, but on the physics or information signalling of the system itself.

‘Control’

The control of system A over system B means that A can influence system B to achieve A’s desired subset of state space.
To engineer control of fully autonomous AI requires tracking (detect, model, simulate, gauge misalignments in) effects internally to then correct for those effects externally.

‘Long term’

In theory: into perpetuity.
In practice: over hundreds of years.

‘AI safety’

Ambient conditions caused by AI’s operations fall within the environmental range needed for the survival of humans (a minimum-threshold definition of safety).

‘Fully autonomous AI’

Any assembly of artificial components that persistently learns new functions from (and thus maintains and adapts its components across) the environment, without the need for humans or other organic life.

‘AI convergent dynamic that cannot be sufficiently controlled’

The learning/adapting components propagate new environmental effects, where a subset of the dynamically explored state space is both unsafe and falls outside even one fundamental limit to control.

‘Verify’

To check whether an argument’s premises are empirically sound and whether its reasoning steps are logically valid.

Lenses of Control

WillPetilloOct 22, 2024, 7:51 AM

14 points

0 comments9 min readLW link

Limits to Control Workshop

Orpheus, Remmelt Ellen and T-bo🔸

May 18, 2025, 4:05 PM

12 points

2 comments3 min readLW link

Deconfusing ‘AI’ and ‘evolution’

RemmeltJul 11, 2025, 1:44 AM

12 points

9 comments26 min readLW link

The Robot, the Puppet-master, and the Psychohistorian

WillPetilloDec 28, 2024, 12:12 AM

8 points

2 comments3 min readLW link

Projects I would like to see (possibly at AI Safety Camp)

Linda LinseforsSep 27, 2023, 9:27 PM

22 points

12 comments4 min readLW link

The Control Problem: Unsolved or Unsolvable?

RemmeltJun 2, 2023, 3:42 PM

56 points

46 comments13 min readLW link

Project Moonbeam

WillPetilloJun 27, 2025, 9:08 PM

14 points

2 comments6 min readLW link

Principles of AI Uncontrollability

WillPetilloAug 7, 2025, 9:10 PM

1 point

0 comments7 min readLW link

Why Recursive Self-Improvement Might Not Be the Existential Risk We Fear

Nassim_ANov 24, 2024, 5:17 PM

1 point

0 comments9 min readLW link

What if Alignment is Not Enough?

WillPetilloMar 7, 2024, 8:10 AM

16 points

46 comments9 min readLW link

No comments.