1. There may be some concrete problem with how the model handles PDF and OCR. This is not my domain, but I want to pass it on to people who can look into it and possibly do something about it.
2. Generally I agree we have work to do on getting models to be completely honest in reporting what they did or didn’t do (to use a term I used before, Machines of Faithful Obedience). This is a longer term effort which I do care about and work on, and I agree we would not get there by band aids or patches.
There are two separate issues:
1. There may be some concrete problem with how the model handles PDF and OCR. This is not my domain, but I want to pass it on to people who can look into it and possibly do something about it.
2. Generally I agree we have work to do on getting models to be completely honest in reporting what they did or didn’t do (to use a term I used before, Machines of Faithful Obedience). This is a longer term effort which I do care about and work on, and I agree we would not get there by band aids or patches.