I liked the point about “the reason GPT3 isn’t consequentialist is that it doesn’t find it’s way to the same configuration when you perturb the starting conditions.” I think I could have generated that definition of consequentialism, but would have trouble making the connection on-the-fly. (At least, I didn’t successfully generate it in between reading Scott’s confusion and Eliezer’s explanation).
I liked the point about “the reason GPT3 isn’t consequentialist is that it doesn’t find it’s way to the same configuration when you perturb the starting conditions.” I think I could have generated that definition of consequentialism, but would have trouble making the connection on-the-fly. (At least, I didn’t successfully generate it in between reading Scott’s confusion and Eliezer’s explanation).
I feel like I now get it more crisply.