Satya Benson comments on Is instrumental convergence a thing for virtue-driven agents?

Satya Benson 2 Apr 2025 19:17 UTC
1 point
0
I’ve been thinking about a similar thing a lot.
Consider a little superintelligent child who always wants to eat as much candy as possible over the course of the next ten minutes. Assume the child doesn’t ever care about what happens ten minutes from now.
This child won’t work very hard at any instrumental goals like self improvement and conquering the world to redirect resources towards candy production, since that would be a waste of time, even though it might maximize candy consumption in the long term.
AI alignment isn’t any easier here, the point of this is just to illustrate that instrumental convergence is far from given.