Error

LW server reports: not allowed.

This probably means the post has been deleted or moved back to the author's drafts.

Vanessa Kosoy 10 Feb 2018 9:50 UTC
LW: 2 AF: 1
0
AF
Delegative Reinforcement Learning solves this problem by keeping humans in the loop while preserving consequentialist reasoning. Ofc currently the theory is based on a lot of simplification and the ultimate learning protocol will probably look differently, but I think that the basic mechanism (delegation combined with model-based reasoning) is sound.