RSS

HanneWhitt

Karma: 64

AI Safety Technical Research Manager at Meridian Research, Cambridge UK. Background in AI Control (MARS, LASR)

Un­faith­ful Rea­son­ing Can Fool Chain-of-Thought Monitoring

2 Jun 2025 19:08 UTC
78 points
17 comments3 min readLW link