Jack Sanderson comments on Gradient routing is better than pretraining filtering