How does “Safety-II” compare with Eliezer’s description of security mindset? On the surface they sound very similar, and I would expect highly reliable organizations to value a security mindset in some form.
I don’t recall how Eliezer thinks of security mindset, but in its original context it’s more about thinking like an adversary and designing things knowing that they will be subject to attack from multiple angles and that you might fail to anticipate all angles of attack so you better be ready for the unexpected.
How does “Safety-II” compare with Eliezer’s description of security mindset? On the surface they sound very similar, and I would expect highly reliable organizations to value a security mindset in some form.
I don’t recall how Eliezer thinks of security mindset, but in its original context it’s more about thinking like an adversary and designing things knowing that they will be subject to attack from multiple angles and that you might fail to anticipate all angles of attack so you better be ready for the unexpected.