paulfchristiano comments on Asymptotically Unambitious AGI

paulfchristiano 9 Mar 2019 23:57 UTC
LW: 5 AF: 3
0
AF
I’m sympathetic to this picture, though I’d probably be inclined to try to model it explicitly—by making some assumption about what the planning algorithm can actually do, and then showing how to use an algorithm with that property. I do think “just write down the algorithm, and be happier if it looks like a ‘normal’ algorithm” is an OK starting point though
Given that the setup is basically a straight reinforcement learner with a weird prior, I think that at that level of abstraction, the ceiling of competitiveness is quite high.
Stepping back from this particular thread, I think the main problem with competitiveness is that you are just getting “answers that look good to a human” rather than “actually good answers.” If I try to use such a system to navigate a complicated world, containing lots of other people with more liberal AI advisors helping them do crazy stuff, I’m going to quickly be left behind.
It’s certainly reasonable to try to solve safety problems without attending to this kind of competitiveness, though I think this kind of asymptotic safety is actually easier than you make it sound (under the implicit “nothing goes irreversibly wrong at any finite time” assumption).
What links here?
- michaelcohen's comment on Asymptotically Unambitious AGI by michaelcohen (1 Apr 2019 23:39 UTC; 13 points)
- michaelcohen's comment on Asymptotically Unambitious AGI by michaelcohen (10 Mar 2019 3:38 UTC; 7 points)
- michaelcohen 10 Mar 2019 3:39 UTC
  LW: 3 AF: 2
  0
  AF Parent
  Starting a new thread on this:
  Stepping back from this particular thread, I think the main problem with competitiveness is that you are just getting “answers that look good to a human” rather than “actually good answers.”
  here.