Kei comments on Shane Legg interview on alignment

Kei 29 Oct 2023 22:02 UTC
7 points
3
I think this is one reasonable interpretation of his comments. But the fact that he:

1. Didn’t say very much about a solution to the problem of making models want to follow our ethical principles, and
2. Mostly talked about model capabilities even when explicitly asked about that problem

makes me think it’s not something he spends much time thinking about, and is something he doesn’t think is especially important to focus on.