RobinHanson comments on Detached Lever Fallacy

RobinHanson 31 Jul 2008 20:28 UTC
7 points
0
We can agree that it does not suffice at all to treat an AI program as if it were a human child. But can we also agree that the state of a “grown” AI program will depend on the environment in which it was “raised”?
- khafra 12 Mar 2025 6:54 UTC
  4 points
  0
  Parent
  It may be time to revisit this question. With Owain Evans et. al. discovering a generalized evil vector in LLMs, and older work like [Pretraining Language Models with Human Preferences](https://www.lesswrong.com/posts/8F4dXYriqbsom46x5/pretraining-language-models-with-human-preferences) that could use a follow-up, AI in the current paradigm seems ripe for some experimentation with parenting practices in pre-training—perhaps something like affect markers for the text that goes in, or pretraining on children’s literature before going on to the more technically and morally complex text?
  I haven’t run any experiments of my own, but this doesn’t seem obviously stupid to me.