jessicata comments on Notes from a conversation on act-based and goal-directed systems

jessicata 4 Mar 2016 6:30 UTC
0 points
0
AF
Suppose there are N binary dimensions that predictors can vary on. Then we’d need $2^{N}$ predictors to cover every possibility. On the other hand, we would only need to consider N possible modifications to a predictor. Of course, if the dimensions factor that nicely, then you can probably make enough assumptions about the hypothesis class that you can learn from the $2^{N}$ experts efficiently.

Overall it seems nicer to have a guarantee of the form “if there is a predictable bias in the predictions, then the system will correct this bias” rather than “if there is a strictly better predictor than a bad predictor, then the system will listen to the good predictor”, since it allows capabilities to be distributed among predictors instead of needing to be concentrated in a single predictor. But maybe things work anyway for the reason you gave.