“FDT outputs X” is importantly underspecified. If you just change what it says in your exact situation, then you have no account of how it changes the other algorithms like the simulator.
The bigger problem though isn’t the counterlogicals but analyzing how in the counterlogical world other algorithms would be different.
Newcomb’s Revenge actually is a fair problem which is not quite as much of a problem as “If you use FDT, I will kill you”. But if you accept Newcomb’s Revenge as symmetric to Newcomb’s problem then you have to also deny that any decision theory can be better than any other. For any decision problem where we simulate an agent in situation X, decide a payoff matrix based on X for a different instance which is in a different situation Y, then you can come up with a flip. Newcomb’s problem X is important because it depends on your behaviour in the situation that you are actually in! Which breaks the asymmetry.
And ok if you actually want to think about this in depth it gets super cursed super quickly, because any FDT agent will have to have a prior over being in a Newcomblike problem vs a Revengelike problem and if it turns out that in the real world, Newcomb problems are rare and Revenge problems are common, then FDT will update accordingly and start one-boxing, but actually the concept of high-fidelity simulation at some level itself breaks down because the simulator can induce any behaviour in your simulated self by giving you arbitrary inputs.
Newcomb’s problem also works if you treat the simluated agents as agents themselves who have an accurate prior over their own situation (the maths takes a while to shake out but it definitely does). While Revenge only works in this way if the simulated agents think they’re being simulated by a Newcomblike Omega (otherwise, if they know they’re 50:50 real or being simulated by a Revengelike Omega, they will two box).
I don’t understand your claim on how exactly the counterlogicals break. Normally the obstacle is Lob’s theorem, which IIUC Logical Induction fixes, but if you have a stronger argument then I would like to hear it.
Why is Newcomb’s any less fair than Newcomb’s revenge? https://www.umsu.de/blog/2022/772
“FDT outputs X” is importantly underspecified. If you just change what it says in your exact situation, then you have no account of how it changes the other algorithms like the simulator.
The bigger problem though isn’t the counterlogicals but analyzing how in the counterlogical world other algorithms would be different.
Newcomb’s Revenge actually is a fair problem which is not quite as much of a problem as “If you use FDT, I will kill you”. But if you accept Newcomb’s Revenge as symmetric to Newcomb’s problem then you have to also deny that any decision theory can be better than any other. For any decision problem where we simulate an agent in situation X, decide a payoff matrix based on X for a different instance which is in a different situation Y, then you can come up with a flip. Newcomb’s problem X is important because it depends on your behaviour in the situation that you are actually in! Which breaks the asymmetry.
And ok if you actually want to think about this in depth it gets super cursed super quickly, because any FDT agent will have to have a prior over being in a Newcomblike problem vs a Revengelike problem and if it turns out that in the real world, Newcomb problems are rare and Revenge problems are common, then FDT will update accordingly and start one-boxing, but actually the concept of high-fidelity simulation at some level itself breaks down because the simulator can induce any behaviour in your simulated self by giving you arbitrary inputs.
Newcomb’s problem also works if you treat the simluated agents as agents themselves who have an accurate prior over their own situation (the maths takes a while to shake out but it definitely does). While Revenge only works in this way if the simulated agents think they’re being simulated by a Newcomblike Omega (otherwise, if they know they’re 50:50 real or being simulated by a Revengelike Omega, they will two box).
I don’t understand your claim on how exactly the counterlogicals break. Normally the obstacle is Lob’s theorem, which IIUC Logical Induction fixes, but if you have a stronger argument then I would like to hear it.