Дмитрий Зеленский comments on 2. Corrigibility Intuition

Дмитрий Зеленский 10 May 2026 21:51 UTC
1 point
−1
In a literal sense, a corrigible agent is one that can be corrected. But in the context of AI alignment, I believe that the word should mean something stronger than mere correctability.
Can I submit that “correctable” is language misuse? :)