Gpt-4.1 is an expecially soulless model. It’s intended for API use only, whereas chatgpt-latest is meant to chat with humans. It’s not as bad as o1-mini—that model is extremely autistic and has no concept of emotion. This would work much better with ~pretrained models. Likely you can get gpt-4-base or llama 405b base to do much better with just prompting and no RL.
Gpt-4.1 is an expecially soulless model. It’s intended for API use only, whereas chatgpt-latest is meant to chat with humans. It’s not as bad as o1-mini—that model is extremely autistic and has no concept of emotion. This would work much better with ~pretrained models. Likely you can get gpt-4-base or llama 405b base to do much better with just prompting and no RL.
Or DeepSeek-V3-Base.