Hmm, I agree that Paul’s definition is different from mine, but it feels to me like they are both pointing at the same thing.
I think this means that under your definition, behavioral cloning and approval-directed agents are subsets of narrow value learning
That’s right.
whereas under Paul’s definition they are disjoint from narrow value learning.
I’m not sure. I would have included them, because sufficiently good behavioral cloning/approval-directed agents would need to learn instrumental goals and values in order to work effectively in a domain.
was this overloading of the term intentional?
It was intentional, in that I thought that these were different ways of pointing at the same thing.
Hmm, I agree that Paul’s definition is different from mine, but it feels to me like they are both pointing at the same thing.
That’s right.
I’m not sure. I would have included them, because sufficiently good behavioral cloning/approval-directed agents would need to learn instrumental goals and values in order to work effectively in a domain.
It was intentional, in that I thought that these were different ways of pointing at the same thing.