In a literal sense, a corrigible agent is one that can be corrected. But in the context of AI alignment, I believe that the word should mean something stronger than mere correctability.
Can I submit that “correctable” is language misuse? :)
Can I submit that “correctable” is language misuse? :)