Shiroe comments on AGI-level reasoner will appear sooner than an agent; what the humanity will do with this reasoner is critical

Shiroe 30 Jul 2022 21:29 UTC
3 points
3
I like your idea that economic incentives will become the safety bottleneck more so than corrigibility. Many would argue that a pure reasoner actually can influence the world through e.g. manipulation, but this doesn’t seem very realistic to me if the model is memoryless and doesn’t have the ability to recursively ask itself new questions.

Adding such capabilities is fairly easy, however. Which is what your concern is about.