Yup, the second go-round with Project Vend was a lot better, almost up to “disastrous 1999 dotcom” levels of management. Even including the bizarre late-night emails from the “CEO model” full of weird, enthusiastic spiritual rants.
I should be clear: Opus 4.5 is a very large piece of a general intelligence. And it’s getting better rapidly. But it’s still missing some really critical stuff, too.
Also, my job doesn’t come with a lot of built-in affordances, except the ones that I set up. On the one hand, giving Opus 4.5 a CLI sandbox gives it a lot of options for setting up CLI accounting software, etc. On the other hand, even Gemini still struggles with Pokemon video games, despite some heavy duty affordances like a map-management tool. A key part of being a general intelligence is being able to function without too much hand holding, basically.
Yup, the second go-round with Project Vend was a lot better, almost up to “disastrous 1999 dotcom” levels of management. Even including the bizarre late-night emails from the “CEO model” full of weird, enthusiastic spiritual rants.
I should be clear: Opus 4.5 is a very large piece of a general intelligence. And it’s getting better rapidly. But it’s still missing some really critical stuff, too.
Also, my job doesn’t come with a lot of built-in affordances, except the ones that I set up. On the one hand, giving Opus 4.5 a CLI sandbox gives it a lot of options for setting up CLI accounting software, etc. On the other hand, even Gemini still struggles with Pokemon video games, despite some heavy duty affordances like a map-management tool. A key part of being a general intelligence is being able to function without too much hand holding, basically.