In that case, can you imagine an AGI that, given that I can’t attack and kill all humans (it is unwise), is coerced into given a human readable solution to the alignment problem? If no, why not?
In that case, can you imagine an AGI that, given that I can’t attack and kill all humans (it is unwise), is coerced into given a human readable solution to the alignment problem? If no, why not?