RSS

Logan Riggs

Karma: 4,041

When Are Two Net­works the Same? Ten­sor Similar­ity for Mechanis­tic Interpretability

29 May 2026 15:53 UTC
36 points
3 comments4 min readLW link

Black Boxes for Low-Stakes, In­ter­pretable AI for High-Stakes

Logan Riggs28 May 2026 15:34 UTC
18 points
0 comments2 min readLW link

Write Cause You Have Some­thing to Say

Logan Riggs8 May 2026 13:36 UTC
37 points
5 comments2 min readLW link

Am­bi­tious Mech In­terp w/​ Ten­sor-trans­form­ers on toy lan­guages [Pro­ject Pro­posal]

Logan Riggs1 May 2026 19:17 UTC
21 points
0 comments2 min readLW link

Con­sent-Based RL: Let­ting Models En­dorse Their Own Train­ing Updates

Logan Riggs17 Apr 2026 13:53 UTC
76 points
6 comments3 min readLW link

Mass Surveillance w/​ LLMs is the De­fault Out­come. Con­tracts Won’t Change That.

Logan Riggs3 Mar 2026 21:18 UTC
43 points
1 comment2 min readLW link

How to Reset

Logan Riggs18 Feb 2026 19:49 UTC
10 points
2 comments2 min readLW link