ACCount comments on avturchin’s Shortform

ACCount 30 Mar 2025 15:03 UTC
1 point
0
This is way more metacognitive skill than what I would have expected an LLM to have. I can make sense of how an LLM would be able to do that, but only in retrospect.
And if a modern high end LLM already knows on some level and recognizes its own uncertainty? Could you design a fine tuning pipeline to reduce hallucination level based on that? At least for reasoning models, if not for all of them?
- avturchin 30 Mar 2025 16:55 UTC
  2 points
  0
  Parent
  It looks like (based on the article published a few days ago by Anthropic about the microscope) Claude Sonnet was trained to distinguish facts from hallucinations, so it’s not surprising that it knows when it hallucinates.
  - ACCount 31 Mar 2025 10:35 UTC
    1 point
    0
    Parent
    Is the same true for GPT-4o then, which could spot Claude’s hallucinations?
    Might be worth testing a few open source models with better known training processes.