orthonormal comments on orthonormal’s Shortform

orthonormal 3 Mar 2026 19:59 UTC
4 points
0
And yet, current LLMs have noticeably different personas from each other, as well as coding skills that significantly outstrip what you would expect from imitation of the corpus. So their post-training has a large impact.
- cubefox 3 Mar 2026 23:03 UTC
  7 points
  0
  Parent
  The pre-training forms the foundation (LeCun: “Self-supervised learning: The dark matter of intelligence”, tailcalled: “At its most basic, unsupervised prediction forms a good foundation for later specializing the map to perform specific types of prediction”) which gives the model common sense and general abilities, while reinforcement learning adds something like goal orientation on top.