Just re-read this because you cited it recently, and I like it even more the second time :)
I also like an intermediate point between the changes you lay out: keeping the “old style” tree diagram that puts outer alignment and objective robustness together under “intent alignment,” but changing the interpretation of these boxes to your “new style” version where outer alignment is less impressive / stringent and robustness is more central.
Just re-read this because you cited it recently, and I like it even more the second time :)
I also like an intermediate point between the changes you lay out: keeping the “old style” tree diagram that puts outer alignment and objective robustness together under “intent alignment,” but changing the interpretation of these boxes to your “new style” version where outer alignment is less impressive / stringent and robustness is more central.