RSS

jacob_drori

Karma: 565

ja­cob_drori’s Shortform

jacob_drori1 Aug 2025 17:47 UTC
7 points
6 comments1 min readLW link

[Re­search Note] Op­ti­miz­ing The Fi­nal Out­put Can Obfus­cate CoT

30 Jul 2025 21:26 UTC
196 points
22 comments6 min readLW link

SAE on ac­ti­va­tion differences

30 Jun 2025 17:50 UTC
44 points
3 comments5 min readLW link

Sparsely-con­nected Cross-layer Transcoders

jacob_drori18 Jun 2025 17:13 UTC
45 points
3 comments12 min readLW link

There is a globe in your LLM

jacob_drori8 Oct 2024 0:43 UTC
89 points
4 comments1 min readLW link

Do­main-spe­cific SAEs

jacob_drori7 Oct 2024 20:15 UTC
28 points
2 comments5 min readLW link

Open Source Au­to­mated In­ter­pretabil­ity for Sparse Au­toen­coder Features

30 Jul 2024 21:11 UTC
67 points
1 comment13 min readLW link
(blog.eleuther.ai)