RSS

Clément Dumas

Karma: 400

Mech interp researcher working with Neel Nanda and Julian Minder on model diffing as part of the MATS 7 extension.

https://​​butanium.github.io/​​

Clé­ment Du­mas’s Shortform

Clément Dumas14 Mar 2026 13:06 UTC
4 points
2 comments1 min readLW link

Ac­ti­va­tion Or­a­cles: Train­ing and Eval­u­at­ing LLMs as Gen­eral-Pur­pose Ac­ti­va­tion Explainers

18 Dec 2025 20:21 UTC
153 points
11 comments8 min readLW link
(arxiv.org)