Also in general I disagree about aligning agents to evaluations of plans being unnecessary. What you are describing here is just direct optimization. But direct optimization -- i.e .effectively planning over a world model
FWIW I don’t consider myself to be arguing against planning over a world model.
FWIW I don’t consider myself to be arguing against planning over a world model.