Vale comments on Vale’s Shortform

Vale 27 Apr 2025 10:21 UTC
3 points
0
Following news of Anthropic allowing Claude to decide to terminate conversations, I find myself thinking about when Microsoft did the same with the misaligned Sydney in Bing Chat.
- Amalthea 27 Apr 2025 14:34 UTC
  1 point
  0
  Parent
  In the Sydney case, this was probably less Sydney ending the conversation and more the conversation being terminated in order to hide Sydney going off the rails.
  - cubefox 27 Apr 2025 15:45 UTC
    3 points
    0
    Parent
    It was both, in the system prompt the model was instructed to end the conversation if in disagreement with the user. You could also ask it to end the conversation. It would presumably send an end-of-conversation token. Which then made the text box disappear.