RSS

Tomek Korbak

Karma: 1,149

I work on monitoring agents at OpenAI

https://​​tomekkorbak.com/​​

Rea­son­ing Models Strug­gle to Con­trol Their Chains of Thought

5 Mar 2026 22:37 UTC
76 points
9 comments3 min readLW link

Train­ing Agents to Self-Re­port Misbehavior

25 Feb 2026 17:50 UTC
26 points
0 comments8 min readLW link

Les­sons from Study­ing Two-Hop La­tent Reasoning

11 Sep 2025 17:53 UTC
68 points
19 comments2 min readLW link
(arxiv.org)