David Duvenaud comments on The Risk of Gradual Disempowerment from AI

David Duvenaud 8 Feb 2025 13:46 UTC
3 points
0
Great question. I think treacherous turn risk is still under-funded in absolute terms. And gradual disempowerment is much less shovel-ready as a discipline.

I think there are two reasons why maybe this question isn’t so important to answer:
1) The kinds of skills required might be somewhat disjoint.
2) Gradual disempowerment is perhaps a subset or extension of the alignment problem. As Ryan Greenblatt and others point out: at some point, agents aligned to one person or organization will also naturally start working on this problem at the object level for their principals.