So 3X more training compute would be a 2X speed-up. Could bump to 3X speed-up due to the additional runtime compute. So overall this would make the slower tail-end of the IE happen 2-3X faster.
I.e. roughly linear.
So this does mean the IE happens faster. I.e. 10 years in 6 months rather than in 12 months.
But i was then commenting on how long it goes on for. Where i think the extra compute makes less difference between once r<1 things slow down fairly quickly. So you maybe still only get ~11 years in 12 months.
Could use the online tool to figure this out. Just do two runs, and in one of them double the ‘initial speed’. That has a similar effect to doubling compute.
Think we agree on that.
My last comment says:
I.e. roughly linear.
So this does mean the IE happens faster. I.e. 10 years in 6 months rather than in 12 months.
But i was then commenting on how long it goes on for. Where i think the extra compute makes less difference between once r<1 things slow down fairly quickly. So you maybe still only get ~11 years in 12 months.
Could use the online tool to figure this out. Just do two runs, and in one of them double the ‘initial speed’. That has a similar effect to doubling compute.