Sam Clarke comments on Some thoughts on risks from narrow, non-agentic AI

Sam Clarke 30 Jun 2021 10:11 UTC
1 point
0

being trained on “follow instructions”

What does this actually mean, in terms of the details of how you’d train a model to do this?
- Richard_Ngo 30 Jun 2021 20:54 UTC
  3 points
  0
  Parent
  Take a big language model like GPT-3, and then train it via RL on tasks where it gets given a language instruction from a human, and then it gets reward if the human thinks it’s done the task successfully.
  - Sam Clarke 1 Jul 2021 7:52 UTC
    1 point
    0
    Parent
    Makes sense, thanks!