David Scott Krueger comments on Just Imitate Humans?

David Scott Krueger 6 Aug 2019 2:32 UTC
LW: 2 AF: 2
0
AF
I don’t know if it’s come up in the comments, but naive (e.g. not cognitive-architecturally-informed) approaches seem fairly likely (~40%? OTTMH) to produce mesa-optimizationy-things, to me, see: https://www.lesswrong.com/posts/whRPLBZNQm3JD5Zv8/imitation-learning-considered-unsafe
Otherwise, yes, seems great, esp. if we just imitate AI safety researchers and let them go on to solve all the safety problems.