Kei Nishimura-Gasparian comments on Reward hacking behavior can generalize across tasks