Now, almost a year later, it would be interesting to see to what extent this has changed (generally, the capabilities progress has been quite remarkable; so one is naturally curious to see to what degree this progress extends to writing various coding questions (and also if an agentic framework like Claude Code would do better on this task than a “naked model”)).
Thanks for writing this.
Now, almost a year later, it would be interesting to see to what extent this has changed (generally, the capabilities progress has been quite remarkable; so one is naturally curious to see to what degree this progress extends to writing various coding questions (and also if an agentic framework like Claude Code would do better on this task than a “naked model”)).
Thank you I will.
There’s definitely a lot of things that could be improved in the methodology as well.