michaelcohen comments on Asymptotically Unambitious AGI

michaelcohen 6 Mar 2019 1:16 UTC
0 points
Comment thread: minor concerns
- David Scott Krueger (formerly: capybaralet) 1 Apr 2019 0:53 UTC
  2 points
  Parent
  Just exposition-wise, I’d front-load pi^H and pi^* when you define pi^B, and also clarify then that pi^B considers human-exploration as part of it’s policy.
- David Scott Krueger (formerly: capybaralet) 1 Apr 2019 0:52 UTC
  1 point
  Parent
  ″ This result is independently interesting as one solution to the problem of safe exploration with limited oversight in nonergodic environments, which [Amodei et al., 2016] discus ”
  ^ This wasn’t super clear to me.… maybe it should just be moved somewhere else in the text?
  I’m not sure what you’re saying is interesting here. I guess it’s the same thing I found interesting, which is that you can get sufficient (and safe-as-a-human) exploration using the human-does-the-exploration scheme you propose. Is that what you mean to refer to?
  - michaelcohen 1 Apr 2019 1:16 UTC
    1 point
    Parent
    Yeah that’s what I mean to refer to: this is a system which learns everything it needs to from the human while querying her less and less, which makes human-lead exploration viable from a capabilities standpoint. Do you think that clarification would make things clearer?
- David Scott Krueger (formerly: capybaralet) 1 Apr 2019 0:16 UTC
  1 point
  Parent
  ETA: NVM, what you said is more descriptive (I just looked in the appendix).
  RE footnote 2: maybe you want to say “monotonically increasing as a function of” rather than “proportional to”. (It’s a shame there doesn’t seem to be a shorter way of saying the first one, which seems to be more often what people actually want to say...)
  - David Scott Krueger (formerly: capybaralet) 1 Apr 2019 0:19 UTC
    1 point
    Parent
    Maybe “promotional of” would be a good phrase for this.
- Pattern 6 Mar 2019 2:24 UTC
  0 points
  Parent
  Is this where typos go?
  - ErickBall 6 Mar 2019 13:05 UTC
    1 point
    Parent
    Typo: some of the hover-boxes say nu but seem to be referring to the letter mu.
    - michaelcohen 7 Mar 2019 0:12 UTC
      1 point
      Parent
      Thank you, I’ll have to clarify that. For now, $ν$ is a general world-model, and $μ$ is a specific one, so in the hover text, I explain the notation with a general case. But I see how that’s confusing.
  - michaelcohen 6 Mar 2019 3:17 UTC
    1 point
    Parent
    Yes, but this is also for things that seem like mistakes in the exposition, but either have simple fixes or don’t impact the main theorems.