Prometheus comments on An overview of 11 proposals for building safe advanced AI

Prometheus 24 Oct 2022 11:28 UTC
5 points
0
The biggest problem I have with a lot of these is they require human feedback. Imagine a Chess AI receiving human feedback on each move having to compete with Alpha Zero’s self-supervised RL system, which beat every other Chess AI and human after just 72 hours of training. I just don’t see how human-feedback systems can possibly compete.
- Vaniver 24 Oct 2022 16:04 UTC
  4 points
  0
  Parent
  You can try to have feedback separately on the ‘ultimate desirability’ of consequences and the ‘practical usefulness’ of actions, where you build the consequence-prediction model solely from experimental data and the value-estimation model solely from human feedback. I think this runs into serious issues because humans have to solve the mixed problem, not the split problem, and so it will be difficult for humans to give well-split training data.
  As well, having a solution that’s “real but expensive” would be a real step up from having no solution!