Oliver Daniels comments on Why does off-model SFT degrade capabilities?