Tao Lin comments on Generating the Funniest Joke with RL (according to GPT-4.1)

Tao Lin 17 May 2025 4:51 UTC
11 points
6
Gpt-4.1 is an expecially soulless model. It’s intended for API use only, whereas chatgpt-latest is meant to chat with humans. It’s not as bad as o1-mini—that model is extremely autistic and has no concept of emotion. This would work much better with ~pretrained models. Likely you can get gpt-4-base or llama 405b base to do much better with just prompting and no RL.
- cubefox 17 May 2025 7:10 UTC
  3 points
  0
  Parent
  Or DeepSeek-V3-Base.