leogao comments on Recent AI model progress feels mostly like bullshit

leogao 26 Mar 2025 19:28 UTC
17 points
7
Actual full blown fraud in frontier models at the big labs (oai/anthro/gdm) seems very unlikely. Accidental contamination is a lot more plausible but people are incentivized to find metrics that avoid this. Evals not measuring real world usefulness is the obvious culprit imo and it’s one big reason my timelines have been somewhat longer despite rapid progress on evals.
- Kabir Kumar 1 Apr 2025 0:49 UTC
  1 point
  0
  Parent
  Why does it seem very unlikely?
  - Jazi Zilber 7 Apr 2025 3:44 UTC
    3 points
    1
    Parent
    those conspiracies don’t work most of the time “you can only keep a secret between two people, provided one of them is dead”.
    the personal risk for anyone involved + the human psychological tendency to chat and to have a hard time holding on to immortal secrets mean it’s usually irrational for both organisations to do intentional cheating.