ryan_greenblatt comments on Jan Betley’s Shortform

ryan_greenblatt 12 Jun 2025 5:21 UTC
4 points
2
Importantly, I think we have a good argument (which might convince the AI) for why this would be a good policy in this case.

I’ll engage with the rest of this when I write my pro-strong-corrigibility manifesto.