David Johnston comments on GPT-4

David Johnston 14 Mar 2023 21:41 UTC
5 points
0
Did gpt 3.5 get high scores on human exams before fine tuning? My rough impression is “gpt4 relies less on fine tuning for its capabilities”
- Carl Feynman 14 Mar 2023 23:52 UTC
  4 points
  0
  Parent
  I assume when you say fine tuning, you mean RLHF. This is table 8 on page 27 of the paper. Some scores went up a few percent, some scores went down a few percent, overall no significant change.
  The biggest changes were that it’s a much worse microeconomist and a much better sommelier. Pretty impressive for a machine with no sense of taste.
  - David Johnston 15 Mar 2023 0:26 UTC
    2 points
    0
    Parent
    Yeah, I saw that—I’m wondering if previous models benefitted more from RLHF