I didn’t see it explicitly mentioned till now whether this method is supposed to work for superhuman AI or just early TAI.
Additionally, multi-step plans allow the agent to use early actions to enter states that are very different from any states that humans have ever experienced.
Wouldn’t a benign superhuman agent routinely enter such states? We would expect it to make complex plans that are superhuman and these trajectories should be very different from what a human could conceive of.
I didn’t see it explicitly mentioned till now whether this method is supposed to work for superhuman AI or just early TAI.
Wouldn’t a benign superhuman agent routinely enter such states?
We would expect it to make complex plans that are superhuman and these trajectories should be very different from what a human could conceive of.