Vladimir_Nesov comments on My take on Vanessa Kosoy’s take on AGI safety

Vladimir_Nesov 2 Oct 2021 20:40 UTC
LW: 4 AF: 1
0
AF
I’d say alignment should be about values, so only your “even better alignment” qualifies. The non-agentic AI safety concepts like corrigibility, that might pave the way to aligned systems if controllers manage to keep their values throughout the process, are not themselves examples of alignment.