Ofer comments on Towards an empirical investigation of inner alignment

Ofer 24 Sep 2019 4:57 UTC
LW: 2 AF: 2
AF
Very interesting! This research direction might lead to researchers having better intuitions about what sort of mesa-objectives we’re more likely to end up with.
Perhaps similar experiments can be done with supervised learning (instead of RL).