Task restriction. The observation that diverse environments seem to increase the probability of mesa-optimization
Where does this observation come from?
Where does this observation come from?