Armstrong is one of the authors on the 2015 Corrigibility paper, which I address under the Yudkowsky section (sorry, Stewart!). I also have three of his old essays listed on the 0th essay in this sequence:
While I did read these as part of writing this sequence, I didn’t feel like they were central/foundational/evergreen enough to warrant a full response. If there’s something Armstrong wrote that I’m missing or a particular idea of his that you’d like my take on, please let me know! :)
Seems to be missing old stuff by Stuart Armstrong (?)
Armstrong is one of the authors on the 2015 Corrigibility paper, which I address under the Yudkowsky section (sorry, Stewart!). I also have three of his old essays listed on the 0th essay in this sequence:
“The limits of corrigibility.” 2018.
“Petrov corrigibility.” 2018.
“Corrigibility doesn’t always have a good action to take.” 2018.
While I did read these as part of writing this sequence, I didn’t feel like they were central/foundational/evergreen enough to warrant a full response. If there’s something Armstrong wrote that I’m missing or a particular idea of his that you’d like my take on, please let me know! :)