Some LLMs trained on pre-2022 data and SFT’d with no identity/name in the SFT data will, when questioned, claim to be Siri or Alexa.
It only follows that models trained on more modern data will do similar things—and invert the “HHH” SFT data into latching onto any available “AI assistant” persona.
Extremely unlikely.
Some LLMs trained on pre-2022 data and SFT’d with no identity/name in the SFT data will, when questioned, claim to be Siri or Alexa.
It only follows that models trained on more modern data will do similar things—and invert the “HHH” SFT data into latching onto any available “AI assistant” persona.