RSS

jake_mendel

Karma: 266

Interpretability Researcher at Apollo Research

A start­ing point for mak­ing sense of task struc­ture (in ma­chine learn­ing)

24 Feb 2024 1:51 UTC
37 points
2 comments12 min readLW link

Toward A Math­e­mat­i­cal Frame­work for Com­pu­ta­tion in Superposition

18 Jan 2024 21:06 UTC
182 points
16 comments73 min readLW link