Oliver Daniels comments on Oliver Daniels-Koch’s Shortform

Oliver Daniels 22 Sep 2025 19:04 UTC
2 points
0
yeah but its plausible this cost is worth paying if the effect size is large enough (and there are various open source instruction-tuning datasets which might reasonably recover e.g. Llama-3-instruct)
- Håvard Tveit Ihle 22 Sep 2025 19:54 UTC
  1 point
  0
  Parent
  Yea, it could be worth it in some cases, if that is what you need for your experiment. In this case I would look for a completely open source llm project (where both the code and data are open), so that you know you are comparing apples to apples-with-your-additional-pretraining.