Thanks for this feedback, this was exactly the sort of response I was hoping for!
You say you disagree where identity comes from, but then I can’t tell where the disagreement is? Reading what you wrote, I just kept nodding along being like ‘yep yep exactly.’ I guess the disagreement is about whether the identity comes from the RL part (step 3) vs. the instruction training (step 2); I think this is maybe a merely verbal dispute though? Like, I don’t think there’s a difference in kind between ‘imposing a helpful assistant from x constraint’ and ‘forming a single personality,’ it’s just a difference of degree.
Thanks for this feedback, this was exactly the sort of response I was hoping for!
You say you disagree where identity comes from, but then I can’t tell where the disagreement is? Reading what you wrote, I just kept nodding along being like ‘yep yep exactly.’ I guess the disagreement is about whether the identity comes from the RL part (step 3) vs. the instruction training (step 2); I think this is maybe a merely verbal dispute though? Like, I don’t think there’s a difference in kind between ‘imposing a helpful assistant from x constraint’ and ‘forming a single personality,’ it’s just a difference of degree.