depends on TDT-sim’s decision, because Omega has already decided, and because Omega didn’t make its decision known, a TDT agent presented with this problem is at an epistemic disadvantage relative to Omega: TDT can’t react to Omega’s actual decision, because it won’t know Omega’s actual decision until it knows it’s own actual decision, at which point TDT can’t further react. This epistemic disadvantage doesn’t need to be enforced temporally; even if TDT knows Omega’s source code, if TDT has limited simulation resources, it might not practically be able to compute Omega’s actual decision any way but via Omega’s dependence on TDT’s decision.
any other agent who is not running TDT … will be able to re-construct the chain of logic and reason that the simulation one-boxed and so box B contains the $1 million
There aren’t other ways for an agent to be at an epistemic disadvantage relative to Omega in this problem than by being TDT? Could you construct an agent which was itself disadvantaged relative to TDT?
Oh, neat. Agents in “lowest terms”, whose definitions don’t refer to other agents, can’t react to any agent’s decision, so they’re all at an epistemic disadvantage relative to each other, and to themselves, and to every other agent across all games.
How is agent epistemically inferior to agent ? They’re both in “lowest terms” in the sense that their definitions don’t make reference to other agents / other facts whose values depend on how environments depend on their values, so they’re functionally incapable of reacting to other agents’ decisions, and are on equivalent footing.
How is agent epistemically inferior to agent ? They’re both constant decisions across all games, both functionally incapable of reacting to any other agent’s actual decisions. Even if we broaden the definition of “react” so that constant programs are reacting to other constant programs, your two agents still have equivalent conditonal power / epistemic vantage / reactive footing.
For problem 1, in the language of the blackmail posts, because the tactic omega uses to fill box 2,
depends on TDT-sim’s decision, because Omega has already decided, and because Omega didn’t make its decision known, a TDT agent presented with this problem is at an epistemic disadvantage relative to Omega: TDT can’t react to Omega’s actual decision, because it won’t know Omega’s actual decision until it knows it’s own actual decision, at which point TDT can’t further react. This epistemic disadvantage doesn’t need to be enforced temporally; even if TDT knows Omega’s source code, if TDT has limited simulation resources, it might not practically be able to compute Omega’s actual decision any way but via Omega’s dependence on TDT’s decision.
There aren’t other ways for an agent to be at an epistemic disadvantage relative to Omega in this problem than by being TDT? Could you construct an agent which was itself disadvantaged relative to TDT?
“Take only the box with $1000.”
Which itself is inferior to “Take no box.”
Oh, neat. Agents in “lowest terms”, whose definitions don’t refer to other agents, can’t react to any agent’s decision, so they’re all at an epistemic disadvantage relative to each other, and to themselves, and to every other agent across all games.
How is agent epistemically inferior to agent ? They’re both in “lowest terms” in the sense that their definitions don’t make reference to other agents / other facts whose values depend on how environments depend on their values, so they’re functionally incapable of reacting to other agents’ decisions, and are on equivalent footing.
How is agent epistemically inferior to agent ? They’re both constant decisions across all games, both functionally incapable of reacting to any other agent’s actual decisions. Even if we broaden the definition of “react” so that constant programs are reacting to other constant programs, your two agents still have equivalent conditonal power / epistemic vantage / reactive footing.