Another reason labs don’t provide CoT is that if users see them, the labs will be incentivized to optimize for them, and this will decrease their informativeness. A flag like you propose would have a similar effect.
Great point! This possibly makes my proposal a Bad idea. I would need to know more about how the labs respond to this kind of incentive to actually know.
Another reason labs don’t provide CoT is that if users see them, the labs will be incentivized to optimize for them, and this will decrease their informativeness. A flag like you propose would have a similar effect.
Great point! This possibly makes my proposal a Bad idea. I would need to know more about how the labs respond to this kind of incentive to actually know.
Labs can provide this kind of information to evaluators instead, so that they don’t have to optimize the CoT for the public.