Charbel-Raphaël comments on Paper: Teaching GPT3 to express uncertainty in words

Charbel-Raphaël 31 May 2022 22:19 UTC
1 point
0
Interestingly, fine-tuning does better than the other methods on Multi-answer, but not that well on multiply-divide. I would have forecast the opposite considering the training task. For example, the model could have just guessed the probability by looking at the number of digits involved in the operation.
Hum, I do not understand why.