Kurt H. Pieper comments on Sanity-checking “Incompressible Knowledge Probes”

Kurt H. Pieper 3 May 2026 18:14 UTC
1 point
0
Possibly the method will underestimate parameter count as time goes on. I don’t expect it to be economically valuable to pretrain on the very long tails of knowledge, as opposed to letting more bits flow in from synthetic data / RLVR. Though I’m surprised as to why this hasn’t already happened.