Naive decision theory: “Choose the action that will cause the highest expected utility, given what I know now.”
Timeless decision theory: “Choose the action that I wish I had precommitted to, given what I know now.”
Updateless decision theory: “Choose a set of rules that will cause the highest expected utility given my priors, then stick to it no matter what happens.”
There’s nothing that prevents a UDT agent from behaving as if it were updating; that’s what I surmise would happen in more normal situations where Omega isn’t involved. But if ignoring information is the winning move, TDT can’t do that.
This is my vague understanding.
Naive decision theory: “Choose the action that will cause the highest expected utility, given what I know now.”
Timeless decision theory: “Choose the action that I wish I had precommitted to, given what I know now.”
Updateless decision theory: “Choose a set of rules that will cause the highest expected utility given my priors, then stick to it no matter what happens.”
If this is accurate, then I don’t see how UDT can generally be better than TDT.
UDT would be better in circumstances where you suspect that your ability to update accurately is compromised.
I’m assuming that the priors for UDT were set at some past time.
UDT gives the money in the counterfactual mugging thought experiment, TDT doesn’t.
There’s nothing that prevents a UDT agent from behaving as if it were updating; that’s what I surmise would happen in more normal situations where Omega isn’t involved. But if ignoring information is the winning move, TDT can’t do that.