RSS

Santiago Aranguri

Karma: 60

Re­pro­duc­ing steer­ing against eval­u­a­tion aware­ness in a large open-weight model

10 Apr 2026 10:45 UTC
76 points
12 comments15 min readLW link

SAE on ac­ti­va­tion differences

30 Jun 2025 17:50 UTC
45 points
3 comments5 min readLW link

Tied Cross­coders: Ex­plain­ing Chat Be­hav­ior from Base Model

Santiago Aranguri22 Mar 2025 18:07 UTC
9 points
0 comments12 min readLW link