Stuart_Armstrong comments on Benchmark for successful concept extrapolation/​avoiding goal misgeneralization