gwern comments on Proposal: Scaling laws for RL generalization