jessicata comments on Nearest unblocked strategy versus learning patches

jessicata 27 Feb 2017 20:31 UTC
0 points
0
AF
This seems pretty similar to this proposal, does that seem right to you?

I think my main objection is the same as the main objection to the proposal I linked to: there has to be a good prior over “what the correct judgments are” such that when this prior is updated on data, it correctly generalizes to cases where we can’t get feedback even in principle. It’s not even clear what “correct judgments” means (you can’t put a human in a box and have them think for 500 years).
- Stuart_Armstrong 28 Feb 2017 11:50 UTC
  0 points
  0
  AF Parent
  No exactly that. What I’m trying to get at is that we know some of the features that failure would have (eg edge cases of utility maximalisation, seductive-seeming or seductively-presented answer), so we should be able to use that knowledge somehow.