RSS

Hoagy

Karma: 1,121

Sparse Au­toen­coders Find Highly In­ter­pretable Direc­tions in Lan­guage Models

Sep 21, 2023, 3:30 PM
159 points

61 votes

Overall karma indicates overall quality.

8 comments5 min readLW link