Correct. However, the method I proposed does not involve redefining one’s utility function, as it leaves terminal values unchanged. It simply recognizes that certain methods of achieving one’s pre-existing terminal values are better than others, which leaves the utility function unaffected (it only alters instrumental values).
The method I proposed is similar to pre-commitment for a causal decision theorist on a Newcomb-like problem. For such an agent, “locking out” future decisions can improve expected utility without altering terminal values. Likewise, a decision theory that fully absorbs such outcome-improving “lockouts” so that it outputs the same actions without explicit pre-commitment can increase its expected utility for the same utility function.
Correct. However, the method I proposed does not involve redefining one’s utility function, as it leaves terminal values unchanged. It simply recognizes that certain methods of achieving one’s pre-existing terminal values are better than others, which leaves the utility function unaffected (it only alters instrumental values).
The method I proposed is similar to pre-commitment for a causal decision theorist on a Newcomb-like problem. For such an agent, “locking out” future decisions can improve expected utility without altering terminal values. Likewise, a decision theory that fully absorbs such outcome-improving “lockouts” so that it outputs the same actions without explicit pre-commitment can increase its expected utility for the same utility function.