ryan_greenblatt comments on ryan_greenblatt’s Shortform

ryan_greenblatt 15 Mar 2026 4:27 UTC
4 points
0
I think current AIs having this property is probably slightly differentially harmful for harder-to-check tasks and generally contributes to underelicitation. I don’t have a very strong view on the sign of general underelicitation in current models, but I tenatively think underelicitation is slightly bad overall.