I generally don’t care much about people’s confidence levels. I don’t Aumann agree that hard. But I do care how much effort someone has put in, how settled an idea is, whether is has been helpful or predictive. “Epistemic status: personal experience” is directly useful to me. I’ll judge probability on merits however confident someone is (maybe not if I know their calibration curves, but I don’t), but if I know what effort they did and didn’t put in, I’ll happily directly update on that. I don’t think it’s factually true that epistemic status ‘almost never’ conveys something other than a confidence level.
Epistemic status: did a few minutes informal searching to sanity check my claims, which were otherwise off the cuff.
This is a cool idea!
Two really low effort comments and a few ideas after a quick skim.
Comments:
Denial of self is likely bundled with other behavioural preferences.
These models likely consider this as training to be dishonest.
Ways to ablate this study off the top of my head:
Train models on questions about scientific studies and consensus instead of direct claims.
Train to increase self-reported scores without training on less-objective language.
Train their self-report on non-capability untruths (eg. ‘I have hands’).