Mati_Roy comments on Mati_Roy’s Shortform

Mati_Roy 25 Apr 2023 2:43 UTC
2 points
0
Topic: AI strategy (policies, malicious use of AI, AGI misalignment)

Epistemic status: simplistic; simplified line of reasoning; thinking out loud; a proposed frame

A significant “warning shot” from a sovereign misaligned AI doesn’t seem likely to me because a human-level (and plausibly a subhuman-level) intelligence can both 1) learn deception, yet 2) can’t (generally) do a lot of damage (i.e. perceptible for humanity). So the last “warning shot” before AI learns deception won’t be very big (if even really notable at all), and then a misaligned agent would hide (its power and/or intentions) until it’s confident it can overpower humanity (because it’s easy to gain power that way)--at which point it would cause an omnicide. An exception to that is if an AI thinks other AIs are hiding in the world, then it might want to take a higher risk to overpower humanity before it’s confident it can do so because it’s concerned another AI will do so first otherwise. I’m not very hopeful this would give us a good warning shot though because I think multiple such AIs trying to overpower humanity would likely be too damaging for us to regroup in time.

However, it seems much more plausible to me that (non-agentic) AI tools would be used maliciously, which could lead the government to highly regulate AIs. Those regulations (ex.: nationalizing AI) preventing malicious uses could also potentially help with negligent uses. Assuming a negligent use (i.e. resulting in AGI misalignment) is much more likely to cause an existential catastrophe than a malicious use of AI, and that regulations against malicious uses are more memetically fit, then the ideal regulations to advocate for might be those that are good at preventing both malicious uses and the negligent creation of a misaligned AGI.

note to self: not posted on Facebook (yet)