Matthew Barnett comments on Malign generalization without internal search

Matthew Barnett 12 Jan 2020 19:48 UTC
1 point
0
AF
If one’s interpretation of the ‘objective’ of the agent is full of piecewise statements and ad-hoc cases, then what exactly are we doing it by describing it as maximizing an objective in the first place? You might as well describe a calculator by saying that it’s maximizing the probability of outputting the following [write out the source code that leads to its outputs]. At some point the model breaks down, and the idea that it is following an objective is completely epiphenomenal to its actual operation. The model that it is maximizing an objective doesn’t shed light on its internal operations any more than just spelling out exactly what its source code is.
- evhub 13 Jan 2020 23:37 UTC
  LW: 2 AF: 1
  0
  AF Parent
  I don’t feel like you’re really understanding what I’m trying to say here. I’m happy to chat with you about this more over video call or something if you’re interested.
  - Matthew Barnett 14 Jan 2020 21:05 UTC
    LW: 2 AF: 1
    0
    AF Parent
    Sure, we can talk about this over video. Check your Facebook messages.