Timeless Decision Theory

TagLast edit: Feb 21, 2025, 9:25 PM by abramdemski

Timeless decision theory (TDT) is a decision theory developed by Eliezer Yudkowsky which, in slogan form, says that agents should decide as if they are determining the output of the abstract computation that they implement. This theory was developed in response to the view that rationality should be about winning (that is, about agents achieving their desired ends) rather than about behaving in a manner that we would intuitively label as rational. Prominent existing decision theories (including causal decision theory, or CDT) fail to choose the winning decision in some scenarios and so there is a need to develop a more successful theory.

Timeless Decision Theory has been replaced by Functional Decision Theory

In response to some of Eliezer’s writing on TDT, Wei Dai came up with Updateless Decision Theory (UDT). UDT is clearly superior to TDT in cases such as counterfactual mugging. TDT gets these problems wrong as a result of updating on its observations before calculating expected utility; even though it is considering the consequences of its policies in the abstract, it is doing so only in the “current branch” (ie, updatefully), and so it misses the positive consequences of its policy on other branches.

Functional decision theory (FDT) was an attempt to write up the general motivation behind both TDT and UDT in a more general way, which would have ideally created an umbrella term for decision theories sharing a flavor with UDT and TDT. To this end, the coathors of the FDT paper attempted to include Wei Dai as a coathor and get his approval of the general write-up as representing the spirit of UDT. However, the direction of the paper ended up heavily incorporating intuitions from causal decision theory (CDT), describing FDT as a shift from physical causality to logical causality, so that abstract mathematical nodes such as (critically) the output of the decision procedure could be included in the causal picture, and understood as exercising causal influence, even over physical circumstances.

Wei Dai intended UDT to be much closer to evidential decision theory (EDT) and further from CDT in spirit, and as such, declined to co-author the paper.

The FDT paper thus describes a general framework which remains agnostic about an updateless approach (like UDT) vs an updateful one (like TDT), but which sticks close to the logical-causality approach introduced by TDT. As such, it can be regarded as a successor to TDT (because it backs off from the fundamental mistake of TDT, namely, its updatefulness, while sticking to the core logical-causality intuition of TDT).

TDT and Newcomb’s problem

A better sense of the motivations behind, and form of, TDT can be gained by considering a particular decision scenario: Newcomb’s problem. In Newcomb’s problem, a superintelligent artificial intelligence, Omega, presents you with a transparent box and an opaque box. The transparent box contains $1000 while the opaque box contains either $1,000,000 or nothing. You are given the choice to either take both boxes (called two-boxing) or just the opaque box (one-boxing). However, things are complicated by the fact that Omega is an almost perfect predictor of human behavior and has filled the opaque box as follows: if Omega predicted that you would one-box, it filled the box with $1,000,000 whereas if Omega predicted that you would two-box it filled it with nothing.

Many people find it intuitive that it is rational to two-box in this case. As the opaque box is already filled, you cannot influence its contents with your decision so you may as well take both boxes and gain the extra $1000 from the transparent box. CDT formalizes this style of reasoning. However, one-boxers win in this scenario. After all, if you one-box then Omega (almost certainly) predicted that you would do so and hence filled the opaque box with $1,000,000. So you will almost certainly end up with $1,000,000 if you one-box. On the other hand, if you two-box, Omega (almost certainly) predicted this and so left the opaque box empty . So you will almost certainly end up with $1000 (from the transparent box) if you two-box. Consequently, if rationality is about winning then it’s rational to one-box in Newcomb’s problem (and hence CDT fails to be an adequate decision theory).

TDT will endorse one-boxing in this scenario and hence endorses the winning decision. When Omega predicts your behavior, it carries out the same abstract computation as you do when you decide whether to one-box or two-box. To make this point clear, we can imagine that Omega makes this prediction by creating a simulation of you and observing its behavior in Newcomb’s problem. This simulation will clearly decide according to the same abstract computation as you do as both you and it decide in the same manner. Now, given that TDT says to act as if deciding the output of this computation, it tells you to act as if your decision to one-box can determine the behavior of the simulation (or, more generally, Omega’s prediction) and hence the filling of the boxes. So TDT correctly endorses one-boxing in Newcomb’s problem as it tells the agent to act as if doing so will lead them to get $1,000,000 instead of $1,000.

TDT and other decision scenarios

TDT also wins in a range of other cases including medical Newcomb’s problems, Parfit’s hitchhiker, and the one-shot prisoners’ dilemma. However, there are other scenarios where TDT does not win, including counterfactual mugging. This suggests that TDT still requires further development if it is to become a fully adequate decision theory. Given this, there is some motivation to also consider alternative decision theories alongside TDT, like updateless decision theory (UDT), which also wins in a range of scenarios but has its own problem cases. It seems likely that both of these theories draw on insights which are crucial to progressing our understanding of decision theory. So while TDT requires further development to be entirely adequate, it nevertheless represents a substantial step toward developing a decision theory that always endorses the winning decision

Formalization of TDT

Coming to fully grasp TDT requires an understanding of how the theory is formalized. Very briefly, TDT is formalized by supplementing causal Bayesian networks, which can be thought of as graphs representing causal relations, in two ways. First, these graphs should be supplemented with nodes representing abstract computations and an agent’s uncertainty about the result of these computations. Such a node might represent an agent’s uncertainty about the result of a mathematical sum. Second, TDT treats decisions as the abstract computation that underlies the agent’s decision process. These two features transform causal Bayesian networks into timeless decision diagrams. Using these supplemented diagrams, TDT is able to determine the winning decision in a whole range of a decision scenarios. For a more detailed description of the formalization of TDT, see Eliezer Yudkowsky’s timeless decision theory paper.

Notable Posts

External Links

Timeless Decision Theory (2010) by Eliezer Yudkowsky
An Introduction to Timeless Decision Theory at Formalised Thinking

Timeless Decision Theory: Problems I Can’t Solve

Eliezer YudkowskyJul 20, 2009, 12:02 AM

57 points

156 comments6 min readLW link

A Paradox in Timeless Decision Theory

AlexMennenOct 25, 2010, 3:09 AM

10 points

7 comments1 min readLW link

Decision Theories: A Semi-Formal Analysis, Part III

orthonormalApr 14, 2012, 7:34 PM

36 points

55 comments9 min readLW link

Decision Theories: A Semi-Formal Analysis, Part II

orthonormalApr 6, 2012, 6:59 PM

26 points

28 comments7 min readLW link

If you choose not to decide, you still have made a choice.

ZviMar 24, 2017, 9:12 PM

5 points

0 comments2 min readLW link

Timeless Causality

Eliezer YudkowskyMay 29, 2008, 6:45 AM

48 points

67 comments8 min readLW link

Decision Theory FAQ

lukeprogFeb 28, 2013, 2:15 PM

119 points

487 comments58 min readLW link

Timeless Control

Eliezer YudkowskyJun 7, 2008, 5:16 AM

47 points

69 comments9 min readLW link

Timelessness as a Conservative Extension of Causal Decision Theory

[deleted]May 28, 2014, 2:57 PM

25 points

65 comments14 min readLW link

How I Lost 100 Pounds Using TDT

ZviMar 14, 2011, 3:50 PM

136 points

242 comments4 min readLW link

One Doubt About Timeless Decision Theories

Chris_LeongOct 22, 2018, 1:39 AM

13 points

8 comments1 min readLW link

The Difference Between Classical, Evidential, and Timeless Decision Theories

DanielLCMar 26, 2011, 9:27 PM

6 points

32 comments1 min readLW link

The absent-minded variations

dr_sMay 17, 2025, 6:57 AM

24 points

13 comments9 min readLW link

Do Timeless Decision Theorists reject all blackmail from other Timeless Decision Theorists?

myrenNov 11, 2022, 12:38 AM

7 points

8 comments3 min readLW link

Timeless Identity

Eliezer YudkowskyJun 3, 2008, 8:16 AM

61 points

248 comments14 min readLW link

A problem with Timeless Decision Theory (TDT)

Gary_DrescherFeb 4, 2010, 6:47 PM

48 points

140 comments3 min readLW link

Ingredients of Timeless Decision Theory

Eliezer YudkowskyAug 19, 2009, 1:10 AM

52 points

232 comments7 min readLW link

Timeless Decision Theory and Meta-Circular Decision Theory

Eliezer YudkowskyAug 20, 2009, 10:07 PM

42 points

37 comments10 min readLW link

Discussion for Eliezer Yudkowsky’s paper: Timeless Decision Theory

AlexeiJan 6, 2011, 12:28 AM

16 points

65 comments1 min readLW link

For the Sake of Pleasure Alone

Greenless MirrorFeb 27, 2025, 8:07 PM

3 points

14 comments12 min readLW link

Anti-Parfit’s Hitchhiker

k64Feb 4, 2022, 11:37 PM

2 points

3 comments1 min readLW link

Newcomb’s paradox complete solution.

Augs SMSHacksMar 15, 2023, 5:56 PM

−12 points

13 comments3 min readLW link

Humans do acausal coordination all the time

Adam JermynNov 2, 2022, 2:40 PM

57 points

35 comments3 min readLW link

FDT is not directly comparable to CDT and EDT

SMKSep 29, 2022, 2:42 PM

42 points

8 comments11 min readLW link

Does Time Linearity Shape Human Self-Directed Evolution, and will AGI/ASI Transcend or Destabilise Reality?

EmmelyFeb 5, 2025, 7:58 AM

1 point

0 comments3 min readLW link

Breaking Newcomb’s Problem with Non-Halting states

SlimepriestessSep 4, 2022, 4:01 AM

16 points

9 comments5 min readLW link

yesaul 4 Jun 2021 20:37 UTC
1 point
“Ignorance is a state of mind, stored in neurons, not the environment. The red ball does not know that we are ignorant of it. A probability is a way of quantifying a state of mind. Our ignorance then obeys useful mathematical properties—Bayesian probability theory—allowing us to systematically reduce our ignorance through observation. How would you go about reducing ignorance if there were no way to measure ignorance? What, indeed, is the advantage of not quantifying our ignorance, once we understand that quantifying ignorance reflects a choice about how to think effectively, and not a physical property of red and white balls?”
I want to propose a short note for this priceless observation. Maybe I’m overreacting, and it’s not as significant as I see it. Apologies if that is so.
Your conjecture presupposes a unidirectional linear, absolute, and static structure of knowledge—a minimal perspective or not generally applicable. It seems as if you have forgotten about the phenomenon, “a fresh pair of eyes.” Literally meaning employing someone who has not gone the same road as you did (a person “more” ignorant of the problem at hand than you) to help you get out of your informational dead-end.
You have fallen into the same trap as the philosophers who believed that there is a formula to ultimate and absolute knowledge and ultimate state of mind, who prophesized that if only we find and follow this formula, everyone will attain eternal happiness. I’m personally skeptical about measuring the quality, quantity, and practical applicability of knowledge or ignorance. Let alone the questions about what really matters and their mutual interaction. But, unfortunately, your mode of thinking will most likely lead to the same premises and methods found in totalitarian regimes, and ultimately to inability to adapt and to intellectual stagnation. If I were to choose one argument against measuring knowledge, it would be that this will preclude the invention of the “ultimate” knowledge elixir and, as a result, will retain random factors in knowledge-seeking.
But to indulge your theory a little further, let me mention a few other predictions. For example, the fabric of knowledge is probably not linear or unidirectional, and it probably has local limits (dead-ends). And probably our perception of truths depends on time and our condition. And also, moving in one direction may increase ignorance in the opposite, so to speak. Of this, we have countless accounts.
When I think about epistemology, I sometimes remember Little Gidding. I think it has a very peculiar relationship with your discovery:
We shall not cease from exploration
And the end of all our exploring
Will be to arrive where we started
And know the place for the first time.

Timeless Decision Theory

Timeless Decision Theory has been replaced by Functional Decision Theory

TDT and Newcomb’s problem

TDT and other decision scenarios

Formalization of TDT

Further Reading

Notable Posts

External Links

See Also

Timeless Decision Theory: Problems I Can’t Solve

A Paradox in Timeless Decision Theory

Decision Theories: A Semi-Formal Analysis, Part III

Decision Theories: A Semi-Formal Analysis, Part II

If you choose not to decide, you still have made a choice.

Timeless Causality

Decision Theory FAQ

Timeless Control

Timelessness as a Conservative Extension of Causal Decision Theory

How I Lost 100 Pounds Using TDT

One Doubt About Timeless Decision Theories

The Difference Between Classical, Evidential, and Timeless Decision Theories

The absent-minded variations

Do Timeless Decision Theorists reject all blackmail from other Timeless Decision Theorists?

Timeless Identity

A problem with Timeless Decision Theory (TDT)

Ingredients of Timeless Decision Theory

Timeless Decision Theory and Meta-Circular Decision Theory

Discussion for Eliezer Yudkowsky’s paper: Timeless Decision Theory

For the Sake of Pleasure Alone

Anti-Parfit’s Hitchhiker

Newcomb’s paradox complete solution.

Humans do acausal coordination all the time

FDT is not directly comparable to CDT and EDT

Does Time Linearity Shape Human Self-Directed Evolution, and will AGI/ASI Transcend or Destabilise Reality?

Breaking Newcomb’s Problem with Non-Halting states

Time­less De­ci­sion Theory

Timeless Decision Theory has been replaced by Functional Decision Theory

TDT and Newcomb’s problem

TDT and other decision scenarios

Formalization of TDT

Further Reading

Notable Posts

External Links

See Also

Timeless Decision Theory