I may be misunderstanding what you’re proposing, but assuming that each decision process has the option to output “I’ve thought enough, no need for another version of me, it’s time to take action X” and have X be “construct this other agent and transfer my resources to it”, the constraint on future versions doesn’t seem to actually do much.
Well, the time to take a decision is limited. I guess that for this to work in full generality we would need that the total computing time of the future agents over a time discount horizon will be insufficient to simulate the “oracle” of even the first agent, which might be a too harsh restriction. Perhaps restricting space will help since space aggregates as max rather than as sum.
I don’t have a detailed understanding of this, but IMO any decision theory that yields robust superrationality (i.e. not only for symmetric games and perfectly identical agents) needs to have some aspect that behaves like this.
I may be misunderstanding what you’re proposing, but assuming that each decision process has the option to output “I’ve thought enough, no need for another version of me, it’s time to take action X” and have X be “construct this other agent and transfer my resources to it”, the constraint on future versions doesn’t seem to actually do much.
Well, the time to take a decision is limited. I guess that for this to work in full generality we would need that the total computing time of the future agents over a time discount horizon will be insufficient to simulate the “oracle” of even the first agent, which might be a too harsh restriction. Perhaps restricting space will help since space aggregates as max rather than as sum.
I don’t have a detailed understanding of this, but IMO any decision theory that yields robust superrationality (i.e. not only for symmetric games and perfectly identical agents) needs to have some aspect that behaves like this.