Not gonna weigh in on the object level but on the meta level I think we’re reaching the point where existing concepts like “corrigibility” and “human morality” are starting to buckle, and we need a better ontology in order to have more productive discussions about this.
Not gonna weigh in on the object level but on the meta level I think we’re reaching the point where existing concepts like “corrigibility” and “human morality” are starting to buckle, and we need a better ontology in order to have more productive discussions about this.
Huh, that seems totally wrong to me. This seems like about as straightforwardly a case of incorrigibility as I can imagine.
Step 1, Solve ethics and morality.
Step 2. Build stronger AI without losing the lightcone or going extinct.
Step 3. Profit.