Vivek Hebbar comments on Operationalizing FDT

Vivek Hebbar 14 Mar 2026 5:01 UTC
LW: 2 AF: 1
0
AF
I did say “suppose you are deterministic”. That said, can you spell out how CDT ratifies the optimal policy if randomization is allowed?
- Lukas Finnveden 14 Mar 2026 7:25 UTC
  LW: 2 AF: 1
  0
  AF Parent
  I believe it follows from this proof: https://www.alignmentforum.org/posts/5bd75cc58225bf06703751b2/in-memoryless-cartesian-environments-every-udt-policy-is-a