I think this is one reasonable interpretation of his comments. But the fact that he:
1. Didn’t say very much about a solution to the problem of making models want to follow our ethical principles, and 2. Mostly talked about model capabilities even when explicitly asked about that problem
makes me think it’s not something he spends much time thinking about, and is something he doesn’t think is especially important to focus on.
I think this is one reasonable interpretation of his comments. But the fact that he:
1. Didn’t say very much about a solution to the problem of making models want to follow our ethical principles, and
2. Mostly talked about model capabilities even when explicitly asked about that problem
makes me think it’s not something he spends much time thinking about, and is something he doesn’t think is especially important to focus on.