Thane Ruthenis comments on Reward is not the optimization target

Thane Ruthenis 28 Jul 2022 12:54 UTC
1 point
0
it seems like the details of humans’ desire for their children’s success, or their fear of death, don’t seem to match well with the theory that all human desires come from RL on intrinsic reward. I guess you probably think they do?
That’s the foundational assumption of the shard theory that this sequence is introducing, yes. Here’s the draft of a fuller overview that goes into some detail as to how that’s supposed to work. (Uh, to avoid confusion: I’m not affiliated with the theory. Just spreading information.)
- cfoster0 28 Jul 2022 15:32 UTC
  3 points
  2
  Parent
  I would disagree that it is an assumption. That same draft talks about the outsized role of self-supervised learning on determining particular ordering and kinds of concepts that humans desires latch onto. Learning from reinforcement is a core component in value formation (under shard theory), but not the only one.