Shmi comments on Seriously, what goes wrong with “reward the agent when it makes you smile”?

Shmi 11 Aug 2022 23:55 UTC
2 points
0
Somewhat unrelated and probably silly… Why reward the agent directly instead of letting it watch humans act in their natural environment and leaving it to build a predictive model of humans?
- green_leaf 12 Aug 2022 7:18 UTC
  1 point
  0
  Parent
  To predict if a human ends up happy with something or not?