tslarm comments on Do you even have a system prompt? (PSA / repo)

tslarm 1 Jun 2025 11:02 UTC
2 points
0
Upvoted, but also I’m curious about this:
If you tell them how to reason, they usually just throw these suggestions out and reason the way RL taught them to reason (and sometimes OpenAI also threatens to ban you over trying to do this).
Can you elaborate on the parenthetical part?
- Thane Ruthenis 1 Jun 2025 11:14 UTC
  4 points
  0
  Parent
  The ban-threat thing? I’m talking about this, which is reportedly still in effect. Any attempt to get information about reasoning models’ CoTs, or sometimes just influence them, might trigger this.