Yair Halberstadt comments on Reasoning models don’t always say what they think

Yair Halberstadt 10 Apr 2025 4:49 UTC
3 points
0
Interesting!
I wonder what results you get for Gemini 2.5 pro. It’s COT seems much more structured than other thinking models and I wonder if that increases or decreases the chance it’ll mention the hint.