Ah, you make this point here:
However, in practical applications some optimization pressure against the helper will still leak in, for example if the helper is used to decide whether to abort a training run or change its hyperparameters.
Ah, you make this point here: