So can we show that TDT would play perfectly in such a scenario? I think yes.
Your decisions interact in complex ways with your peer’s decisions, your descendant’s decisions, etc. There might be a nice simple formula that classifies this, but:
These are TDT agents. So they all behave the same. At least, the ones during the same timestep behave the same. The ones during different time steps are in different situations, but have the same values. Since they (overall) have he same values as past generations, they will together implement a coherent strategy.
So can we show that TDT would play perfectly in such a scenario? I think yes.
Your decisions interact in complex ways with your peer’s decisions, your descendant’s decisions, etc. There might be a nice simple formula that classifies this, but:
These are TDT agents. So they all behave the same. At least, the ones during the same timestep behave the same. The ones during different time steps are in different situations, but have the same values. Since they (overall) have he same values as past generations, they will together implement a coherent strategy.