Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Jacob Dunefsky
Karma:
23
All
Posts
Comments
New
Top
Old
Case Studies in Reverse-Engineering Sparse Autoencoder Features by Using MLP Linearization
Jacob Dunefsky
,
Philippe Chlenski
,
Senthooran Rajamanoharan
and
Neel Nanda
14 Jan 2024 2:06 UTC
22
points
0
comments
42
min read
LW
link
Automatically finding feature vectors in the OV circuits of Transformers without using probing
Jacob Dunefsky
12 Sep 2023 17:38 UTC
13
points
0
comments
29
min read
LW
link
Back to top