tailcalled comments on Contra Steiner on Too Many Natural Abstractions

tailcalled 24 Dec 2022 17:53 UTC
2 points
0
I don’t follow. It seems like you agree that the human values we would like to have optimized is not a natural abstraction, and instead there is some different concept that would be learned instead, if attempting to learn human values as a natural abstraction. This seems like stumbling block, but then your conclusion immediately says that it isn’t a stumbling block.

I’m kind of tipsy so maybe I’m missing something.
- DragonGod 24 Dec 2022 18:07 UTC
  1 point
  0
  Parent
  The most natural abstraction isn’t any specific model of human values, but a minimal model that captures what they have in common.
  - tailcalled 24 Dec 2022 18:36 UTC
    2 points
    0
    Parent
    What can one use this minimal model for?
    - DragonGod 24 Dec 2022 18:46 UTC
      1 point
      0
      Parent
      The minimal model may be the model most agents performing unsupervised learning on human generated data learn.
      Alternatively, most other models imply the minimal model.
      - tailcalled 24 Dec 2022 19:00 UTC
        2 points
        0
        Parent
        This tells us how we get that model but not what one can use it for.
        DragonGod 24 Dec 2022 19:27 UTC
        1 point
        0
        Parent
        I think using such a model as an optimisation target would be existentially safe.