Introduction to Corrigibility

Rubi Hudson will discuss the concept of “corrigibility”, where an AI is willing to accept updates to its goal, why it’s important for AI safety, challenges to achieving it, and some promising new work on the subject.

Registration Instructions
This is a paid event ($5 general admission, free for students & job seekers) with limited tickets—you must RSVP on Luma to secure your spot.

​​Event Schedule
6:00 to 6:30 - Food and introductions
6:30 to 7:30 - Presentation and Q&A
7:30 to 9:00 - Open Discussions

​​If you can’t attend in person, join our live stream starting at 6:30 pm via this link.

​​​This is part of our weekly AI Safety Thursdays series. Join us in examining questions like:

  • ​​​How do we ensure AI systems are aligned with human interests?

  • ​​​How do we measure and mitigate potential risks from advanced AI systems?

  • ​​​What does safer AI development look like?

No comments.