RSS

Adam Karvonen

Karma: 1,323

Real­is­tic Eval­u­a­tions Will Not Prevent Eval­u­a­tion Awareness

Adam Karvonen24 Feb 2026 17:51 UTC
37 points
9 comments6 min readLW link

Ac­ti­va­tion Or­a­cles: Train­ing and Eval­u­at­ing LLMs as Gen­eral-Pur­pose Ac­ti­va­tion Explainers

18 Dec 2025 20:21 UTC
154 points
11 comments8 min readLW link
(arxiv.org)