In Claude’s first try, it played Ironclad on Ascension 1 and died to Hexaghost, the Act 1 boss. It wasn’t terrible but occasionally got the mechanics a little bit mixed up.
If anyone wants to make a proper harness in the future, I think probably the most interesting question here is if the LLM can learn from multiple playthroughs, unlocking harder difficulties, etc.
Interesting how much it’s relying on having information in training data and being able to look stuff up. I wonder how it would do with a “blind” play through of a game that didn’t previously exist.
Please do!
In Claude’s first try, it played Ironclad on Ascension 1 and died to Hexaghost, the Act 1 boss. It wasn’t terrible but occasionally got the mechanics a little bit mixed up.
Here’s the link to the chat history.
Thanks for following through.
If anyone wants to make a proper harness in the future, I think probably the most interesting question here is if the LLM can learn from multiple playthroughs, unlocking harder difficulties, etc.
Modern LLMs, maybe through notetaking?
Interesting how much it’s relying on having information in training data and being able to look stuff up. I wonder how it would do with a “blind” play through of a game that didn’t previously exist.