faul_sname comments on tailcalled’s Shortform

faul_sname 19 Sep 2025 20:56 UTC
2 points
0

which are intractable to map out

Yeah, until recently I thought the same thing, based on my belief that distilling a teacher model which has been trained by RL into a student model preserved not just distributions over outputs but also mostly preserved the mechanisms behind those outputs. Which as far as I can tell was an incorrect belief.