Very interesting! This research direction might lead to researchers having better intuitions about what sort of mesa-objectives we’re more likely to end up with.
Perhaps similar experiments can be done with supervised learning (instead of RL).
Very interesting! This research direction might lead to researchers having better intuitions about what sort of mesa-objectives we’re more likely to end up with.
Perhaps similar experiments can be done with supervised learning (instead of RL).