RSS

keshavs

Karma: 34

In­tro­spec­tion Adapters: Train­ing LLMs to Re­port Their Learned Behaviors

28 Apr 2026 19:02 UTC
41 points
1 comment12 min readLW link
(alignment.anthropic.com)