Shubhorup Biswas comments on MONA: Managed Myopia with Approval Feedback

Shubhorup Biswas 20 Jun 2025 15:21 UTC
1 point
0
I didn’t see it explicitly mentioned till now whether this method is supposed to work for superhuman AI or just early TAI.
Additionally, multi-step plans allow the agent to use early actions to enter states that are very different from any states that humans have ever experienced.
Wouldn’t a benign superhuman agent routinely enter such states?
We would expect it to make complex plans that are superhuman and these trajectories should be very different from what a human could conceive of.