Interesting new paper that examines this question:
When Chain of Thought is Necessary, Language Models Struggle to Evade Monitors (see §5)
Interesting new paper that examines this question:
When Chain of Thought is Necessary, Language Models Struggle to Evade Monitors (see §5)