Seth Herd comments on Foom & Doom 2: Technical alignment is hard

Seth Herd 25 Jun 2025 20:18 UTC
2 points
0
It’s an argument for why aligning a self-modifying superintelligence requires more than aligning the base LLM. I don’t think it’s impossible, just that there’s another step we need to think through carefully.