Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Tom McGrath
Karma:
108
All
Posts
Comments
New
Top
Old
[Linkpost] Play with SAEs on Llama 3
Tom McGrath
,
Eric Ho
and
Dan Balsam
25 Sep 2024 22:35 UTC
41
points
2
comments
1
min read
LW
link
[Paper] All’s Fair In Love And Love: Copy Suppression in GPT-2 Small
CallumMcDougall
,
Arthur Conmy
,
Cody Rushing
,
Tom McGrath
and
Neel Nanda
13 Oct 2023 18:32 UTC
82
points
4
comments
8
min read
LW
link
Back to top