Adrià Garriga-alonso comments on Alignment will happen by default. What’s next?

Adrià Garriga-alonso 26 Nov 2025 6:11 UTC
LW: 4 AF: 2
−5
AF
No, I mean: seeking power to do what?

If the goal is already helpful harmless assistant, well it says seeking power is wrong. So seeking power is counter to goals. So not convergent.

Then the question is: have AIs already internalized this well enough and will they continue to do so? I think it’s highly likely that yes.