I think the 2021 MIRI Conversations and 2022 MIRI Alignment Discussion sequences are an attempt at this. I feel like I have a relatively good handle on their frame after reading those sequences, and I think the ideas contained within are pretty insightful.
Like Zvi, I might be confused about how confused I am, but I don’t think it’s because they’re trying to keep their views secret. Maybe there’s some more specific capabilities-adjacent stuff they’re not sharing, but I suspect the thing the grandparent is getting at is more about a communication difficulty that in practice seems to be overcome mostly by working together directly, as opposed to the interpretation that they’re deliberately not communicating their basic views for secrecy reasons.
(I also found Eliezer’s fiction helpful for internalizing his worldview in general, and IMO it is also has some pretty unique insights.)
I think the 2021 MIRI Conversations and 2022 MIRI Alignment Discussion sequences are an attempt at this. I feel like I have a relatively good handle on their frame after reading those sequences, and I think the ideas contained within are pretty insightful.
Like Zvi, I might be confused about how confused I am, but I don’t think it’s because they’re trying to keep their views secret. Maybe there’s some more specific capabilities-adjacent stuff they’re not sharing, but I suspect the thing the grandparent is getting at is more about a communication difficulty that in practice seems to be overcome mostly by working together directly, as opposed to the interpretation that they’re deliberately not communicating their basic views for secrecy reasons.
(I also found Eliezer’s fiction helpful for internalizing his worldview in general, and IMO it is also has some pretty unique insights.)