ryan_greenblatt comments on Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

ryan_greenblatt 29 Nov 2023 1:25 UTC
4 points
0
To clarify I don’t think that LLM agents are necessarily or obviously safe. I was just trying to argue that it’s plausible that they could achieve long terms objectives while also not having “wanting” in the sense necessary for (some) AI risk arguments to go through. (edited earlier comment to make this more clear)
- Nathan Helm-Burger 29 Nov 2023 3:05 UTC
  2 points
  0
  Parent
  Thanks for the clarification!