Stuart_Armstrong comments on Proper value learning through indifference