adamShimi comments on [AN #125]: Neural network scaling laws across multiple modalities