avturchin comments on avturchin’s Shortform

avturchin 30 Mar 2025 16:55 UTC
2 points
0
It looks like (based on the article published a few days ago by Anthropic about the microscope) Claude Sonnet was trained to distinguish facts from hallucinations, so it’s not surprising that it knows when it hallucinates.
- ACCount 31 Mar 2025 10:35 UTC
  1 point
  0
  Parent
  Is the same true for GPT-4o then, which could spot Claude’s hallucinations?
  Might be worth testing a few open source models with better known training processes.