james oofou comments on Daniel Kokotajlo’s Shortform

james oofou 21 Feb 2025 13:37 UTC
3 points
0
Grok 3 used maybe 3x more compute than 4o or Gemini and topped Chatbot Arena and many benchmarks despite the facts that xAI was playing catch-up and 3x isn’t that significant since the gain is logorithmic.
I take Grok 3′s slight superiority as evidence for, not against, the importance of scaling hardware.
- aog 21 Feb 2025 16:01 UTC
  2 points
  0
  Parent
  How do we know it was 3x? (If true, I agree with your analysis)
  - james oofou 21 Feb 2025 16:08 UTC
    5 points
    2
    Parent
    Based on Vladimir_Nesov’s calculations:
    https://www.lesswrong.com/posts/WNYvFCkhZvnwAPzJY/go-grok-yourself?commentId=p3nTkpshMq7SmXLjc