Håvard Tveit Ihle comments on Oliver Daniels-Koch’s Shortform

Håvard Tveit Ihle 22 Sep 2025 17:34 UTC
7 points
6
If you have to repeat the entire post-training all the way from a base model, that is obviously a lot more work than just adding a small fine-tuning stage to an already post-trained model.

The full post-training can also only really be done by a big lab which has their own full post-training stack. Post training is getting more and more advanced and complicated with each month.
- Oliver Daniels 22 Sep 2025 19:04 UTC
  2 points
  0
  Parent
  yeah but its plausible this cost is worth paying if the effect size is large enough (and there are various open source instruction-tuning datasets which might reasonably recover e.g. Llama-3-instruct)
  - Håvard Tveit Ihle 22 Sep 2025 19:54 UTC
    1 point
    0
    Parent
    Yea, it could be worth it in some cases, if that is what you need for your experiment. In this case I would look for a completely open source llm project (where both the code and data are open), so that you know you are comparing apples to apples-with-your-additional-pretraining.