Karl von Wendt comments on Late 2021 MIRI Conversations: AMA / Discussion

Karl von Wendt 2 Mar 2022 8:02 UTC
LW: 17 AF: 11
0
AF
I see how my above question seems naive. Maybe it is. But if one potential answer to the alignment problem lies in the way our brains work, maybe we should try to understand that better, instead of (or in addition to) letting a machine figure it out for us through some kind of “value learning”. (Copied from my answer to AprilSR:) I stumbled across two papers from a few years ago by a psychologist, Mark Muraven, who thinks that the way humans deal with conflicting goals could be important for AI alignment (https://arxiv.org/abs/1701.01487 and h ttps://arxiv.org/abs/1703.06354).They appear a bit shallow to me and don’t contain any specific ideas on how to implement this. But maybe Muraven has a point here.
- TurnTrout 14 Jul 2022 2:32 UTC
  LW: 8 AF: 4
  0
  AF Parent
  I think your question is excellent. “How does the single existing kind of generally intelligent agent form its values?” is one of the most important and neglected questions in all of alignment, I think.
- ESRogs 2 Mar 2022 19:24 UTC
  4 points
  0
  Parent
  But if one potential answer to the alignment problem lies in the way our brains work, maybe we should try to understand that better, instead of (or in addition to) letting a machine figure it out for us through some kind of “value learning”.
  Ah, I see. You might be interested in this sequence then!
  - Karl von Wendt 3 Mar 2022 5:47 UTC
    3 points
    0
    Parent
    Yes, thank you!