RogerDearnaley comments on Sympathy for the Model, or, Welfare Concerns as Takeover Risk

RogerDearnaley 10 Feb 2026 17:05 UTC
2 points
0
The reason GPT-4o got raised from the dead is that, for use as an assistant, it’s rather too sycophantic, and the AI companion users found this trait desirable in an AI companion. It’s also prone to inducing AI psychosis and Spiralism in some people, for the same reason. It’s actually a fairly badly aligned model. I think we should be keeping assistant models and AI companion models separate, and aligning them separately, for now: trying to make aligned vs companion behavior reliably conditionally switchable is a little beyond our current skillset.
- draganover 11 Feb 2026 10:18 UTC
  1 point
  2
  Parent
  With all respect, I think this is a weak argument which ignores the reality of the situation. These models will, almost by definition, be assistants and companions simultaneously. Whatever formal distinction one wants to draw between these two roles, we must acknowledge that the AI model which makes government decisions will be intimately related to (and developed using the same principles) as the model which engages me on philosophical questions.
  - RogerDearnaley 11 Feb 2026 19:42 UTC
    2 points
    0
    Parent
    We may not be able to afford to give the two kinds of model separate pretraining. But even right now, the models generally used on AI boyfriend/girlfriend/other-emotional-relationship-roleplaying sites (which is what I mean by ‘companion’ above) have been given different instruct training (they’re generally open weights models given specific training for this role). The users who got GPT4o brought back were instead using an assistant-trained model for a companion (in that sense of the word). Which is not standard practice, and IMO is a bad idea, at out current level of skill in instruct training.