Julian Bradshaw comments on Insights into Claude Opus 4.5 from Pokémon

Julian Bradshaw 10 Dec 2025 2:50 UTC
5 points
0
Please do!
- CronoDAS 11 Dec 2025 21:42 UTC
  16 points
  0
  Parent
  In Claude’s first try, it played Ironclad on Ascension 1 and died to Hexaghost, the Act 1 boss. It wasn’t terrible but occasionally got the mechanics a little bit mixed up.
  
  Here’s the link to the chat history.
  - Julian Bradshaw 12 Dec 2025 5:20 UTC
    4 points
    0
    Parent
    Thanks for following through.
    If anyone wants to make a proper harness in the future, I think probably the most interesting question here is if the LLM can learn from multiple playthroughs, unlocking harder difficulties, etc.
    Modern LLMs, maybe through notetaking?
  - FiftyTwo 16 Dec 2025 9:12 UTC
    3 points
    0
    Parent
    Interesting how much it’s relying on having information in training data and being able to look stuff up. I wonder how it would do with a “blind” play through of a game that didn’t previously exist.