RSS

Yueh Han "John" Chen

Karma: 97

Safety Research at NYU. MATS 8.0. Prev: UC Berkeley.

www.john-chen.cc

Rea­son­ing Models Strug­gle to Con­trol Their Chains of Thought

5 Mar 2026 22:37 UTC
76 points
9 comments3 min readLW link

Train­ing Agents to Self-Re­port Misbehavior

25 Feb 2026 17:50 UTC
26 points
0 comments8 min readLW link