janos comments on Problems with learning values from observation