Olli Järviniemi comments on Reward is not the optimization target