RSS

ntt123

Karma: 15

Logit Prisms: De­com­pos­ing Trans­former Out­puts for Mechanis­tic Interpretability

ntt12317 Jun 2024 11:46 UTC
5 points
4 comments6 min readLW link
(neuralblog.github.io)

Ex­plor­ing Llama-3-8B MLP Neurons

ntt1239 Jun 2024 14:19 UTC
10 points
0 comments4 min readLW link
(neuralblog.github.io)