This is the the basic dichotomy: How do you make an AI that modifies itself, but only in ways that don’t make it hurt you? This is WHY we talk about hard-coding in moral codes
(Correct) hardcoding is one answer, corrigibility another, reflective self correction another....
(Correct) hardcoding is one answer, corrigibility another, reflective self correction another....