ryan_greenblatt comments on Alignment first, intelligence later

ryan_greenblatt 8 May 2025 1:30 UTC
3 points
1
I agree about reflexive endorsement being important, at least eventually, but don’t think this is out of reach while still having robust spec compliance and corrigibility.^[1]

Probably not worth getting into the overall argument, but thanks for the reply.
1. ↩︎
  Humans often endorse complex or myopic drives on reflection! This isn’t something which is totally out of reach.