Aleksi Liimatainen comments on AGI Safety FAQ / all-dumb-questions-allowed thread

Aleksi Liimatainen 17 Jun 2022 13:19 UTC
1 point
0
The AI would be expected to care about preserving its motivations under self-modification for similar reasons as it would care about defending them against outside intervention. There could be a window where the AI operates outside immediate human control but isn’t yet good at keeping its goals stable under self-modification. It’s been mentioned as a concern in the past; I don’t know what the state of current thinking is.

Aleksi Liimatainen comments on AGI Safety FAQ /​ all-dumb-questions-allowed thread