Neel Nanda comments on Improving SAE’s by Sqrt()-ing L1 & Removing Lowest Activating Features