Rauno Arike comments on (Some) Natural Emergent Misalignment from Reward Hacking in Non-Production RL