technicalities comments on AI in 2025: gestalt

technicalities 8 Dec 2025 3:54 UTC
3 points
0
Not a reliable source, but I’m open to the possibility (footnote 1)
- Sodium 8 Dec 2025 6:25 UTC
  2 points
  0
  Parent
  Yeah I get that the actual parameter count isn’t, but I think the general argument that bigger pre trains remember more facts, and we can use that to try predict the model size.
  - David J Higgs 9 Dec 2025 20:34 UTC
    1 point
    0
    Parent
    Is there a reliable way to distinguish between [remembers more facts] and [infers more correct facts from remembered ones]? If there isn’t, then using remembered facts as an estimate of base model size would be even more noisy than you’d already expect.
    
    I know I get far more questions right on exams than chance would predict when I have 0 direct knowledge/memory of the correct answer. I assume reasoning models have at least some of this kind of capability