What app were you using? This sounds very similar to my experience using GPT-5 in Cursor.Codex CLI is much much better—night and day difference.I suppose this is good evidence that harness-specific RL was important for GPT-5.
What app were you using? This sounds very similar to my experience using GPT-5 in Cursor.
Codex CLI is much much better—night and day difference.
I suppose this is good evidence that harness-specific RL was important for GPT-5.