I agree that most of my concern has moved to inner (and, in particular, deceptive) alignment. I still don’t quite see how to get enough outer alignment to trust an AI with the future lightcone, but I am much less worried about it.
I agree that most of my concern has moved to inner (and, in particular, deceptive) alignment. I still don’t quite see how to get enough outer alignment to trust an AI with the future lightcone, but I am much less worried about it.