tchauvin comments on Reinforcement Learning Goal Misgeneralization: Can we guess what kind of goals are selected by default?