I think, if you had several UDT agents with the same source code, and then one UDT agent with slightly different source code, you might see the unique agent defect.
I think the CDT agent has an advantage here because it is capable of making distinct decisions from the rest of the population—not because it is CDT.
The general hope is that slight differences in source code (or even large differences, as long as they’re all using UDT or something close to it) wouldn’t be enough to make a UDT agents defect against another UDT agent (i.e. the logical correlation between their decisions would be high enough), otherwise “UDT agents cooperate with each other in one-shot PD” would be false or not have much practical implications, since why would all UDT agents have the exact same source code?
There are at least two potential sources of cooperation: symmetry and mutual source code knowledge; symmetry should be fragile to small changes in source code (I expect) as well as asymmetry between the situations of the different parties while mutual source code knowledge doesn’t require those sorts of symmetry at all (but does require knowledge).
Edit: for some reason my intuition expects cooperation from similarity to be less fragile in the Newcomb’s problem/code knowledge case (similarity to simulation) than if the similarity is just plain similarity to another, non-simulation agent. I need to think about why and if this has any connection to what would actually happen.
I think, if you had several UDT agents with the same source code, and then one UDT agent with slightly different source code, you might see the unique agent defect.
I think the CDT agent has an advantage here because it is capable of making distinct decisions from the rest of the population—not because it is CDT.
The general hope is that slight differences in source code (or even large differences, as long as they’re all using UDT or something close to it) wouldn’t be enough to make a UDT agents defect against another UDT agent (i.e. the logical correlation between their decisions would be high enough), otherwise “UDT agents cooperate with each other in one-shot PD” would be false or not have much practical implications, since why would all UDT agents have the exact same source code?
There are at least two potential sources of cooperation: symmetry and mutual source code knowledge; symmetry should be fragile to small changes in source code (I expect) as well as asymmetry between the situations of the different parties while mutual source code knowledge doesn’t require those sorts of symmetry at all (but does require knowledge).
Edit: for some reason my intuition expects cooperation from similarity to be less fragile in the Newcomb’s problem/code knowledge case (similarity to simulation) than if the similarity is just plain similarity to another, non-simulation agent. I need to think about why and if this has any connection to what would actually happen.
I mean, that’s a thing you might hope to be true. I’m not sure if it actually is true.