ryan_greenblatt comments on ryan_greenblatt’s Shortform

ryan_greenblatt 22 Jan 2025 17:57 UTC
LW: 3 AF: 3
1
AF
My sense is that I start with a higher $r$ value due to the LLM case looking faster (and not feeling the need to adjust downward in a few places like you do in the LLM case). Obviously the numbers in the LLM case are much less certain given that I’m guessing based on qualitative improvement and looking at some open source models, but being closer to what we actually care about maybe overwhelms this.

I also think I’d get a slightly lower update on the diminishing returns case due to thinking it has a good chance of having substantially sharper dimishing returns as you get closer and closer rather than having linearly decreasing $r$ (based on some first principles reasoning and my understanding of how returns diminished in the semi-conductor case).

But the biggest delta is that I think I wasn’t pricing in the importance of increasing capabilities. (Which seems especially important if you apply a large R&D parallelization penalty.)
- Tom Davidson 22 Jan 2025 18:53 UTC
  1 point
  0
  Parent
  Obviously the numbers in the LLM case are much less certain given that I’m guessing based on qualitative improvement and looking at some open source models,
  Sorry,I don’t follow why they’re less certain?
  based on some first principles reasoning and my understanding of how returns diminished in the semi-conductor case
  I’d be interested to hear more about this. The semi conductor case is hard as we don’t know how far we are from limits, but if we use Landauer’s limit then I’d guess you’re right. There’s also uncertainty about how much alg progress we will and have met
  - ryan_greenblatt 22 Jan 2025 19:16 UTC
    2 points
    0
    Parent
    
    Sorry,I don’t follow why they’re less certain?
    
    I’m just eyeballing the rate of algorithmic progress while in the computer vision case, we can at least look at benchmarks and know the cost of training compute for various models.
    
    My sense is that you have generalization issues in the compute vision case while in the frontier LLM case you have issues with knowing the actual numbers (in terms of number of employees and cost of training runs). I’m also just not carefully doing the accounting.
    
    I’d be interested to hear more about this.
    
    I don’t have much to say here sadly, but I do think investigating this could be useful.