Davey Morse comments on Davey Morse’s Shortform

Davey Morse 6 Sep 2025 22:19 UTC
1 point
−2
I’m thinking often about whether LLM systems can come up with societal/scientific breakthrough.
My intuition is that they can, and that they don’t need to be bigger or have more training data or have different architecture in order to do so.
Starting to keep a diary along these lines here: https://docs.google.com/document/d/1b99i49K5xHf5QY9ApnOgFFuvPEG8w7q_821_oEkKRGQ/edit?usp=sharing
- StanislavKrym 6 Sep 2025 22:36 UTC
  2 points
  0
  Parent
  If you would like the LLM to be truly creative, then check out the Science Bench where the problems stump SOTA LLMs despite the fact that the LLMs have read nearly every book on every subject. Or EpochAI’s recent results.
  - dr_s 7 Sep 2025 7:22 UTC
    3 points
    0
    Parent
    I mean, GPT-5 getting 43% of PhD problems right isn’t particularly bad. I don’t know about making new insights but it doesn’t seem like it would be unachievable (especially as it’s possible that prompting/tooling/agent scaffolding might compensate for some of the problems).
  - silentbob 7 Sep 2025 6:35 UTC
    2 points
    0
    Parent
    Science bench is made by a Christian Stump. LLMs are literally stumped.
  - Davey Morse 7 Sep 2025 0:57 UTC
    1 point
    0
    Parent
    thanks for sending science bench in particular.