Caleb Biddulph comments on Vestigial reasoning in RL