AdamGleave comments on Benchmark for successful concept extrapolation/​avoiding goal misgeneralization