PotteryBarn comments on All AGI Safety questions welcome (especially basic ones) [~monthly thread]

PotteryBarn 29 Jan 2023 3:35 UTC
1 point
0
Thank you for making these threads. I have been reading LW off and on for several years and this will be my first post.
My question: Is purposely leaning into creating a human wire-header an easier alignment target to hit than the more commonly touted goal of creating an aligned superintelligence that prevents the emergence of other potentially dangerous superintelligence, yet somehow reliably leaves humanity mostly in the driver’s seat?
If the current forecast on aligning superintelligent AI is so dire, is there a point where it would make sense to just settle for ceding control and steering towards creating a superintelligence very likely to engage in wire heading humans (or post-humans)? I’m imagining the AI being tasked with tiling the universe with as many conscious entities as possible, with each experiencing as much pleasure as possible, and maybe with a bias towards maximizing pleasure over number of conscious entities as those goals constrain each other. I don’t want to handwave this as being easy, I’m just curious if there’s been much though to removing the constraint of “don’t wirehead humans.”
Background on me: One of my main departures from what I’ve come across so far is that I don’t share as much concern about hedonism or wire heading. It seems to me a superintelligence would grasp that humans in their current form require things like novelty, purpose, and belonging and wouldn’t just naively pump humans—as they currently are—full of drugs and call it a day. I don’t see why these “higher values” couldn’t be simulated or stimulated too. If I learned my current life was a simulation, but one that was being managed by an incredibly successful AI that could reliably keep the simulation running unperturbed, I would not want to exit my current life in order to seize more control in the “real world.”
Honestly, if an AI could replace humans with verifiably conscious simple entities engineered to experience greater satisfaction than current humans and without ever feeling boredom, I’m hard pressed to thing of anything more deserving of the universe’s resources.
The main concern I have with this so far is that the goal of “maximizing pleasure” could be very close in idea-space to “maximizing suffering,” but it’s still really hard for me to see a superintelligence cable of becoming a singleton making such an error or why it would deliberately switch to maximizing suffering.
- mruwnik 1 Feb 2023 11:23 UTC
  1 point
  0
  Parent
  It would depend a lot on the implementation details. Also on whether it’s possible. The basic wireheader seems simpler, in that it doesn’t need to factor in humanities stupid ideas.
  Adding in extra details, like that it has to pretty much make a virtual heaven for people, makes it a lot harder than just pumping them with morphine.
  maximizing pleasure is very close in idea space to max suffering, because both do pretty much the same, but with opposite signs (yes, this is a travesty of an oversimplification, but you get the idea). The most common raised problems are mainly Goodhart issues. How would you specify that the AI maximise pleasure?
  You seem to have a slightly idiosyncratic interpretation of wireheading, in that it’s usually described as blissing out on drugs, rather than being simulated in an enriching and fun environment. A lot more people would be fine with the second option than would agree to the first one.
  As to the simple, conscious blissers, they seem empty to me. I expect this is a difference in basic morality/worldview though, in that I don’t find hedonism attractive. And boredom is very useful.