RSS

Rohin Shah(Rohin Shah)

Karma: 13,631

Research Scientist at DeepMind. Creator of the Alignment Newsletter. http://​​rohinshah.com/​​

Ex­plain­ing grokking through cir­cuit efficiency

8 Sep 2023 14:39 UTC
95 points
8 comments3 min readLW link
(arxiv.org)

Does Cir­cuit Anal­y­sis In­ter­pretabil­ity Scale? Ev­i­dence from Mul­ti­ple Choice Ca­pa­bil­ities in Chinchilla

20 Jul 2023 10:50 UTC
43 points
3 comments2 min readLW link
(arxiv.org)