I agree about reflexive endorsement being important, at least eventually, but don’t think this is out of reach while still having robust spec compliance and corrigibility.[1]
Probably not worth getting into the overall argument, but thanks for the reply.
I agree about reflexive endorsement being important, at least eventually, but don’t think this is out of reach while still having robust spec compliance and corrigibility.[1]
Probably not worth getting into the overall argument, but thanks for the reply.
Humans often endorse complex or myopic drives on reflection! This isn’t something which is totally out of reach.