ryan_greenblatt comments on ryan_greenblatt’s Shortform

ryan_greenblatt 15 May 2026 15:07 UTC
2 points
0
No. But it seems tractable to have better tests of whether AIs would do intermediate bad actions using this methodology.