Simon Lermen comments on Why I don’t believe Superalignment will work

Simon Lermen 24 Sep 2025 9:05 UTC
3 points
0
For the record, I think that this blog post was mostly intended for frontier labs pushing this plan, the situation is different for independent orgs. I think that there is useful work to be done on subproblems with AI-assisted alignment such as interpretability. So I agree that there is prosaic alignment that can be done, though I am probably still much less optimistic than you.