Roman Leventov comments on Value learning in the absence of ground truth