After further thought, I need to retract my last comment. Consider P(next symbol is 0|H) again, and suppose you’ve seen 100 0′s so far, so essentially you’re trying to predict BB(101). The human mathematician knows that any non-zero number he writes down for this probability would be way too big, unless he resorts to non-constructive notation like 1/BB(101). If you force him to answer “over and over, what their probability of the next symbol being 0 is” and don’t allow him to use notation like 1/BB(101) then he’d be forced to write down an inconsistent probability distribution. But in fact the distribution he has in mind is not computable, and that explains how he can beat Solomonoff Induction.
After further thought, I need to retract my last comment. Consider P(next symbol is 0|H) again, and suppose you’ve seen 100 0′s so far, so essentially you’re trying to predict BB(101). The human mathematician knows that any non-zero number he writes down for this probability would be way too big, unless he resorts to non-constructive notation like 1/BB(101). If you force him to answer “over and over, what their probability of the next symbol being 0 is” and don’t allow him to use notation like 1/BB(101) then he’d be forced to write down an inconsistent probability distribution. But in fact the distribution he has in mind is not computable, and that explains how he can beat Solomonoff Induction.