ryan_greenblatt comments on Value learning in the absence of ground truth