RSS

Yogesh Prabhu

Karma: 22

obsessed with interp

Can You Hide From a Nat­u­ral Lan­guage Au­toen­coder?

Yogesh Prabhu24 Jun 2026 2:41 UTC
11 points
2 comments7 min readLW link
(yogesh.bearblog.dev)

The Dual-Use Gap

Yogesh Prabhu14 Jun 2026 17:43 UTC
5 points
2 comments4 min readLW link
(yogesh.bearblog.dev)

The Prag­matic In­ter­pretabil­ity Trap

Yogesh Prabhu11 May 2026 4:06 UTC
7 points
0 comments3 min readLW link
(yogesh.bearblog.dev)