Glad to see a response of this nature actually. The first thing I thought when I read this post was that a good response to Eliezer’s question would be extremely relevant to the AI-box quandary. If we trust the AI more than ourselves, voila, the AI can convince us to let it out of the box.
Glad to see a response of this nature actually. The first thing I thought when I read this post was that a good response to Eliezer’s question would be extremely relevant to the AI-box quandary. If we trust the AI more than ourselves, voila, the AI can convince us to let it out of the box.