James Hoffend comments on Recent LLMs can do 2-hop and 3-hop latent (no-CoT) reasoning on natural facts

James Hoffend 1 Jan 2026 15:50 UTC
3 points
0
That is very interesting! I am curious about the Opus 4 > Opus 4.5 result. Any guess why the newer model does worse here?
- Adele Lopez 1 Jan 2026 19:17 UTC
  12 points
  1
  Parent
  It’s a third of the price, so maybe it’s just a smaller model.
  - kaiwilliams 1 Jan 2026 22:55 UTC
    5 points
    0
    Parent
    I came here to comment the exact same thing. I wonder if 2-hop latent reasoning is correlated well with Simple-QA scores.