Gunnar_Zarncke comments on Towards deconfusing wireheading and reward maximization