RSS

Aidan Ewart

Karma: 192

Undergraduate student studying Mathematics @ University of Bristol.

Interested in & persuing a career in technical AI safety.

Sparse Au­toen­coders: Fu­ture Work

21 Sep 2023 15:30 UTC
34 points
5 comments6 min readLW link

Sparse Au­toen­coders Find Highly In­ter­pretable Direc­tions in Lan­guage Models

21 Sep 2023 15:30 UTC
156 points
7 comments5 min readLW link