John_Maxwell comments on Why GPT wants to mesa-optimize & how we might change this

John_Maxwell 22 Sep 2020 11:38 UTC
2 points
My thought was that if lookahead improves performance during some period of the training, it’s liable to develop mesa-optimization during that period, and then find it to be a useful for other things later on.