paulfchristiano comments on Asymptotically Unambitious AGI

paulfchristiano 8 Mar 2019 4:55 UTC
LW: 5 AF: 3
0
AF
For the asymptotic results, one has to consider environments that produce observations with the true objective probabilities (hence the appearance that I’m unconcerned with competitiveness). In practice, though, given the speed prior, the agent will require evidence to entertain slow world-models, and for the beginning of its lifetime, the agent will be using low-fidelity models of the environment and the human-explorer, rendering it much more tractable than a perfect model of physics. And I think that even at that stage, well before it is doing perfect simulations of other humans, it will far surpass human performance. We manage human-level performance with very rough simulations of other humans.
I’m keen on asymptotic analysis, but if we want to analyze safety asymptotically I think we should also analyze competitiveness asymptotically. That is, if our algorithm only becomes safe in the limit because we shift to a super uncompetitive regime, it undermines the use of the limit as analogy to study the finite time behavior.
(Though this is not the most interesting disagreement, probably not worth responding to anything other than the thread where I ask about “why do you need this memory stuff?”)
- michaelcohen 8 Mar 2019 6:37 UTC
  LW: 3 AF: 2
  0
  AF Parent
  That is, if our algorithm only becomes safe in the limit because we shift to a super uncompetitive regime, it undermines the use of the limit as analogy to study the finite time behavior.
  Definitely agree. I don’t think it’s the case that a shift to super uncompetitiveness is actually an “ingredient” to benignity, but my only discussion of that so far is in the conclusion: “We can only offer informal claims regarding what happens before BoMAI is definitely benign...”