lemonhope comments on lemonhope’s Shortform

lemonhope 24 Sep 2025 23:21 UTC
27 points
2
Two AI accelerants I had not noticed before:
1. Users with the hardest problems will use the smartest model available. So if you release your best model and your competitor only releases their 2nd smartest, then you will get better training data from users. Not just code, but vision and robotics too.
2. Among those affected by AI job loss is AI/ML devs/researchers themselves. The higher the hiring bar for AI devs, (A) the more applicants will put effort into learning stuff deeply and doing interesting useful novel work to get a job and (B) the easier it is to spot/snag unusual talent.
- sjadler 25 Sep 2025 8:13 UTC
  16 points
  0
  Parent
  FWIW, my experience was that the utility of user data was always much higher in promise than in actual outcomes. This might have changed over time though.
  - ACCount 25 Sep 2025 10:41 UTC
    1 point
    0
    Parent
    There’s a lot of “they used user data to shoot themselves in the foot” and not nearly enough “they used user data to improve performance” happening in the industry.
    Maybe frontier labs will finally crack applying user feedback once the training data bottleneck begins to bite? I imagine that getting good utility out of user data is hard, both from the standpoint of the engineering required, and the computation required.
- anaguma 25 Sep 2025 3:39 UTC
  1 point
  0
  Parent
  Factor 1 is true only to the degree that the models cannot effectively generate hard problems for themselves to solve. If they can generate problems with verifiable rewards just at the edge of their capabilities, similar to AlphaZero, I expect these to be more useful than human generated problems. E.g. there are a lot of incredibly difficult unsolved conjectures in math which provide no RL gradients for labs to train on.