Rafael Harth comments on AGI Safety FAQ / all-dumb-questions-allowed thread

Rafael Harth 10 Jun 2022 11:37 UTC
2 points
0
Reinforcement Learning is easy to conceptualize. The key missing ingredient is that we explicitly specify algorithms to maximize the reward. So this is disanalogous to humans: to train your 5yo, you need only give the reward and the 5yo may adapt their behavior because they value the reward; in a reinforcement learning agent, the second step only occurs because we make it occur. You could just as well flip the algorithm to pursue minimal rewards instead.
- AmberDawn 10 Jun 2022 14:13 UTC
  1 point
  0
  Parent
  Thanks!
  
  I think my question is deeper—why do machines ‘want’ or ‘have a goal to’ follow the algorithm to maximize reward? How can machines ‘find stuff rewarding’?
  - Rafael Harth 10 Jun 2022 15:44 UTC
    2 points
    0
    Parent
    As far as current systems are concerned, the answer is that (as far as anyone knows) they don’t find things rewarding or want things. But they can still run a search to optimize a training signal, and that gives you an agent.