Is 80% the highest success rate you can practically test?
UPD Thomas essentially answered elsewhere:
20% and 80% time horizons are kind of fake because there aren’t enough parameters to fit them separately.We fit a two-parameter logistic model which doesn’t fit the top and bottom of the success curve separately, so improving performance on 20% horizon tasks can lower 80% horizon.
20% and 80% time horizons are kind of fake because there aren’t enough parameters to fit them separately.
We fit a two-parameter logistic model which doesn’t fit the top and bottom of the success curve separately, so improving performance on 20% horizon tasks can lower 80% horizon.
Is 80% the highest success rate you can practically test?
UPD Thomas essentially answered elsewhere: