I’d like AGIs to have a big red shutdown button that is used/tested regularly, so we know that the AI will shut down and won’t try to interfere. I’m not saying this is sufficient to prove that the AI is safe, just that I would sleep better at night knowing that stop-button corrigibility is solved.
I’d like AGIs to have a big red shutdown button that is used/tested regularly, so we know that the AI will shut down and won’t try to interfere. I’m not saying this is sufficient to prove that the AI is safe, just that I would sleep better at night knowing that stop-button corrigibility is solved.