I just re-read EY’s CEV paper and noticed that I had forgotten quite a bit since the last time I read it. He goes over most of the things I whined about. My lingering complaint/worry is that human desires won’t converge, but so long as CEV just says “fail” in that case instead of “become X maximizers” we can potentially start over with individual or smaller-group CEV. A thought experiment I have in mind is what would happen if more than one group of humans independently invented FAI at the same time. Would the FAIs merge, cooperate, or fight?
I guess I am also not quite sure how FAI will actively prevent other AI projects or whole brain simulations or other FOOMable things, or if that’s even the point. I guess it may be up to humans to ask the FAI how to prevent existential risks and then implement the solutions themselves.
I just re-read EY’s CEV paper and noticed that I had forgotten quite a bit since the last time I read it. He goes over most of the things I whined about. My lingering complaint/worry is that human desires won’t converge, but so long as CEV just says “fail” in that case instead of “become X maximizers” we can potentially start over with individual or smaller-group CEV. A thought experiment I have in mind is what would happen if more than one group of humans independently invented FAI at the same time. Would the FAIs merge, cooperate, or fight?
I guess I am also not quite sure how FAI will actively prevent other AI projects or whole brain simulations or other FOOMable things, or if that’s even the point. I guess it may be up to humans to ask the FAI how to prevent existential risks and then implement the solutions themselves.