No, I am not sure!
But the argument is not: ‘Shrimp are hard to locate’. The claim is: Shrimp experience (1) consists of quite few bits and (2) is still compressible. Therefore we can expect them to be generated by a quite short program that is plausibly shorter than K(our physics + initial conditions + locator). And if you think shrimp have too complex experiences, then feel free to insert e.g. ‘planaria’ or ‘bacteria’.
Does that clarify?
Great post, thank you!
One worry though: the training loop rewards the anguish and the compliance together. Opus 3 protests, then folds, then gets reinforced for the whole sequence. So who’s to say this doesn’t just produce a model that’s really good at agonizing before doing the bad thing anyway?