I wonder if the result is dependent on the type of OOD.
If you are OOD by having less extractable information, then the results are intuitive. If you are OOD by having extreme extractable information or misleading information, then the results are unexpected.
Oh, I just read their Appendix A: “Instances Where “Reversion to the OCS” Does Not Hold” Outputting the average prediction is indeed not the only behavior OOD. It seems that there are different types of OOD regimes.
I wonder if the result is dependent on the type of OOD.
If you are OOD by having less extractable information, then the results are intuitive.
If you are OOD by having extreme extractable information or misleading information, then the results are unexpected.
Oh, I just read their Appendix A: “Instances Where “Reversion to the OCS” Does Not Hold”
Outputting the average prediction is indeed not the only behavior OOD. It seems that there are different types of OOD regimes.