When it comes to disclosure policies, if I’m uncertain between the “MIRI view” and the “Paul Christiano” view, should I bite the bullet and back one approach over the other? Or can I aim to support both views, without worrying that they’re defeating each other?
My current understanding is that it’s coherent to support both at once. That is, I can think that possibly intelligence needs lots of fundamental insights, and that safety needs lots of similar insights (this is supposed to be a characterisation of a MIRI-ish view). I can think that work done on figuring out more about intelligence and how to control it should only be shared cautiously, because it may accelerate the creation of AGI.
I can also think that prosaic AGI is possible, and fundamental insights aren’t needed. Then I might think that I could do research that would help align prosaic AGIs but couldn’t possibly align (or contribute to) an agent-based AGI.
Is the above consistent? Also do people (with better emulators of people) who worry about disclosure think that this makes sense from their point of view?