Daniel Kokotajlo comments on Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

Daniel Kokotajlo 27 Nov 2023 14:10 UTC
LW: 26 AF: 15
22
AF
The thing people seem to be disagreeing about is the thing you haven’t operationalized—the “and it’ll still be basically as tool-like as GPT4” bit. What does that mean and how do we measure it?