Haha, I guess it’s not quite a rigorous experiment. I was originally intending to measure changes in performance on Putnam practice tests, although that could be susceptible to slight Goodharting.
Haha, I guess it’s not quite a rigorous experiment. I was originally intending to measure changes in performance on Putnam practice tests, although that could be susceptible to slight Goodharting.