I agree we probably shouldn’t update much on this, it’s from AI and it’s janky.As for beating the game… well sure, but based on the above graphs it seems like Claude will beat the game within about a year?
Other models have beaten the game months ago, but with more advanced harnesses/scaffolds.
I agree we probably shouldn’t update much on this, it’s from AI and it’s janky.
As for beating the game… well sure, but based on the above graphs it seems like Claude will beat the game within about a year?
Other models have beaten the game months ago, but with more advanced harnesses/scaffolds.