Well, yes, it recreates the UDT solution (or at least it does if it works correctly—I didn’t actually check or anything). But the problem was never about just recreating the UDT solution—it’s about understanding why the non-UDT solution doesn’t work.
Well, yes, it recreates the UDT solution (or at least it does if it works correctly—I didn’t actually check or anything). But the problem was never about just recreating the UDT solution—it’s about understanding why the non-UDT solution doesn’t work.
Because standard decision theory doesn’t know how to deal properly with identical agents and common policies?