jessicata comments on Some problems with making induction benign, and approaches to them

jessicata 10 Mar 2017 22:34 UTC
0 points
0
AF
Agree that IRL doesn’t solve this problem (it just bumps it to another level).

The second tier thing sounds a lot like KWIK learning. I think this is a decent approach if we’re fine with only learning instrumental goals and are using a bootstrapping procedure.
- Vanessa Kosoy 18 Mar 2017 14:04 UTC
  0 points
  0
  AF Parent
  KWIK learning is definitely related in the sense that we want to follow a “conservative” policy that is risk averse w.r.t. its uncertainty regarding the utility function, which is similar to how KWIK learning doesn’t produce labels about which it is uncertain. Btw, do you know which of the open problems in the Li-Littman-Walsh paper are solved by now?
  - jessicata 18 Mar 2017 20:37 UTC
    0 points
    0
    AF Parent
    I don’t know which open problems have been solved.