Hmm, of the faculty Stuart spends the most time thinking about AI alignment, I’m not sure how much the other faculty have thought about corrigibility—they’ll have views about the off switch game, but not about MIRI-style corrigibility.
Most of the staff doesn’t work on technical research, so they probably won’t have strong opinions. Exceptions: Critch and Karthika (though I don’t think Karthika has engaged much with corrigibility).
Probably the best way is to find emails of individual researchers online and email them directly. I’ve also left a message on our Slack linking to this discussion.
Thanks. Two questions:
Do the staff and faculty have a similair diversity of opinions?
Is messaging chai-info@berkeley.edu in orde to contact your peers the right procedure here?
Hmm, of the faculty Stuart spends the most time thinking about AI alignment, I’m not sure how much the other faculty have thought about corrigibility—they’ll have views about the off switch game, but not about MIRI-style corrigibility.
Most of the staff doesn’t work on technical research, so they probably won’t have strong opinions. Exceptions: Critch and Karthika (though I don’t think Karthika has engaged much with corrigibility).
Probably the best way is to find emails of individual researchers online and email them directly. I’ve also left a message on our Slack linking to this discussion.