I will fold on the general point here, it is mostly the case that it doens’t matter and the motivations come from the steering sub-system anyhow and that as a consqeuence it is ounfdationally different from how LLMs learn.
There is obviously no culture on Earth where people are kind and honest because it has simply never occurred to any of them that they could instead be mean or dishonest. So prosociality cannot be a “default assumption”. Instead, it’s a choice that people make every time they interact with someone, and they’ll make that choice based on their all-things-considered desires. Right? Sorry if I’m misunderstanding.
I’m however not certain if I agree with this point, if your in a fully cooperative game, is it your choice that you choose to cooperate? If you’re an agent who uses functional or evidential decision theory and you choose to cooperate with your self in a black box prisoner’s dilemma is that really a choice then?
Like your initial imitations shape your steering system to some extent and so there could be culturally learnt social drives no? I think culture might be conditioning the intial states of your learning environment and that still might be an important part of how social drives are generated?
I hope that makes sense and I apologise if it doesn’t.
I will fold on the general point here, it is mostly the case that it doens’t matter and the motivations come from the steering sub-system anyhow and that as a consqeuence it is ounfdationally different from how LLMs learn.
I’m however not certain if I agree with this point, if your in a fully cooperative game, is it your choice that you choose to cooperate? If you’re an agent who uses functional or evidential decision theory and you choose to cooperate with your self in a black box prisoner’s dilemma is that really a choice then?
Like your initial imitations shape your steering system to some extent and so there could be culturally learnt social drives no? I think culture might be conditioning the intial states of your learning environment and that still might be an important part of how social drives are generated?
I hope that makes sense and I apologise if it doesn’t.