Error

LW server reports: not allowed.

This probably means the post has been deleted or moved back to the author's drafts.

Milan W 10 Nov 2024 20:47 UTC
1 point
1
I’d rather say that RLHF+’ed chatbots are upon-reflection-not-so-shockingly sycophantic, since they have been trained to satisfy their conversational partner.
- CTA 21 Jun 2025 16:14 UTC
  1 point
  0
  Parent
  you’re correct; this was a reflection of my own expectations about kindness more than any exceptional qualities of these chatbots. i reread my submission here somewhat recently and thought “oh god, this does not contain the insight i thought it did”.