The thing people seem to be disagreeing about is the thing you haven’t operationalized—the “and it’ll still be basically as tool-like as GPT4” bit. What does that mean and how do we measure it?
The thing people seem to be disagreeing about is the thing you haven’t operationalized—the “and it’ll still be basically as tool-like as GPT4” bit. What does that mean and how do we measure it?