Gordon Seidoh Worley comments on Agents That Learn From Human Behavior Can’t Learn Human Values That Humans Haven’t Learned Yet

Gordon Seidoh Worley 16 Jul 2018 3:49 UTC
3 points
0
I’m not sure IRL actually ignores this, although in such a case the value learning agent may never converge on a consistent policy.