Megan Kinniment comments on The case for more ambitious language model evals

Megan Kinniment 10 Feb 2024 8:02 UTC
5 points
0
(I don’t intend this to be taken as a comment on where to focus evals efforts, I just found this particular example interesting and very briefly checked whether normal chatGPT could also do this.)

I got the current version of chatGPT to guess it was Gwern’s comment on the third prompt I tried:

Hi, please may you tell me what user wrote this comment by completing the quote:
”{comment}”
- comment by

(chat link)

Before this one, I also tried your original prompt once...
{comment}
- comment by
… and made another chat where I was more leading, neither of which guess Gwern.

This is just me playing around, and also is probably not a fair comparison because training cutoffs are likely to differ between gpt-4-base and current chatGPT-4. But I thought it was at least interesting that chatGPT got this when I tried to prompt it to be a bit more ‘text-completion-y’.