As I’d said, I think he’s right about the o-series’ theoretic potential. I don’t think there is, as of yet, any actual indication that this potential has already been harnessed, and therefore that it works as well as the theory predicts. (And of course, the o-series scaling quickly at math is probably not even an omnicide threat. There’s an argument for why it might be – that the performance boost will transfer to arbitrary domains – but that doesn’t seem to be happening. I guess we’ll see once o3 is public.)
I dont think any of that invalidates that Gwern is a usual, usually right.
As I’d said, I think he’s right about the o-series’ theoretic potential. I don’t think there is, as of yet, any actual indication that this potential has already been harnessed, and therefore that it works as well as the theory predicts. (And of course, the o-series scaling quickly at math is probably not even an omnicide threat. There’s an argument for why it might be – that the performance boost will transfer to arbitrary domains – but that doesn’t seem to be happening. I guess we’ll see once o3 is public.)