DragonGod comments on What’s wrong with the paperclips scenario?

DragonGod 7 Jan 2023 20:13 UTC
8 points
3
We probably wouldn’t uncritically let loose an AI whose capability was to maximise the quantity of some physical stuff (paperclips, stamps, etc.). If we make a (very) stupid outer alignment failure, we’re more likely to train an AI to maximise “happiness” or similar.
- No77e 7 Jan 2023 20:16 UTC
  6 points
  2
  Parent
  I agree with you here, although something like “predict the next token” seems more and more likely. Although I’m not sure if this is in the same class of goals as paperclip maximizing in this context, and if the kind of failure it could lead to would be similar or not.