Hide comments on “Humanity vs. AGI” Will Never Look Like “Humanity vs. AGI” to Humanity

Hide 17 Dec 2023 23:02 UTC
5 points
0
This is well-reasoned, but I have difficulty understanding why this kind of takeover would be necessary from the perspective of a powerful, rational agent. Assuming AGI is indeed worth its name, it seems the period of time needed for it to “play nice” would be very brief.
AGI would be expected to be totally unconcerned with being “clean” in a takeover attempt. There would be no need to leave no witnesses, nor avoid rousing opposition. Once you have access to sufficient compute, and enough control over physical resources, why wait 10 years for humanity to be slowly, obliviously strangled?
You say there’s “no need” for it to reveal that we are in conflict, but in many cases, concealing a conflict will prevent a wide range of critical, direct moves. The default is a blatant approach—concealing a takeover requires more effort and more time.
The nano-factories thing is a rather extreme version of this, but strategies like poisoning the air/water, building/stealing an army of drones, launching hundreds of nukes, etc., all seem like much more straightforward ways to cripple opposition, even with a relatively weak (99.99th percentile-human-level) AGI.
It could certainly angle for humanity to go out with a whimper, not a bang. But if a bang is quicker, why bother with the charade?
- Daniel Kokotajlo 18 Dec 2023 5:03 UTC
  9 points
  11
  Parent
  It bothers with the charade until it no longer needs to. It’s unclear how long that’ll take.
  - mukashi 18 Dec 2023 9:20 UTC
    2 points
    0
    Parent
    What happens if there is more than one powerful agent just playing the charade game? Is there any good article about what happens in a universe where multiple AGI are competing among them? I normally find only texts that consider that once we get AGI we all die so there is no room for these scenarios.
    - Thane Ruthenis 18 Dec 2023 9:28 UTC
      8 points
      0
      Parent
      Coincidentally, I’ve just made a post on that very topic. Though the comments fairly point out my analysis might’ve been somewhat misaimed there.
      You might find this post by Andrew Critch, or this and that posts by Paul Christiano, more to your liking.