koanchuk comments on Knight Lee’s Shortform

koanchuk 30 Jan 2026 0:58 UTC
3 points
0
A deal implies that you have something to offer to the ASI, which you define as powerful enough to take over the universe. What is that?
- Knight Lee 30 Jan 2026 5:09 UTC
  2 points
  0
  Parent
  One “deal with the devil” is to assume that the misaligned ASI will a tiny amount of kindness and won’t kill everyone by default. This view is pretty popular, e.g. see Notes on fatalities from AI takeover. Assuming that a misaligned ASI will be survivable means potentially prioritizing it less, and focusing on making sure China or “bad” humans doesn’t win and all the other issues. This technically isn’t a deal, but is part of what I’m talking about.
  Notes on fatalities from AI takeover cites comment, comment and You can, in fact, bamboozle an unaligned AI into sparing your life by David Matolcsi. Matolcsi’s post is an idea for making deals with the ASI.
  I actually agree with the trade idea in Matolcsi’s post
  I especially agree with this part
  “We could have enough control over our simulation and the AI inside it, that when it tries to calculate the probability of humans solving alignment, we could tamper with its thinking to make it believe the probability of humans succeeding is very low. Thus, if it comes to believe in our world that the probability that the humans could have solved alignment is very low, it can’t really trust its calculations.”
  I like this part because it’s an acausal trade between counterfactual futures rather than an acausal trade between different parts of the multiverse within the same future.
  This means the trade works even in the worst counterfactual where $\approx 0 %$ of civilizations in the entire multiverse managed to solve alignment.
  This type of acausal trade also genuinely benefits from commitment or action now, rather than something we can wait after the singularity to worry about, because it might later become impossible to do such acausal trade once we ourselves learn the true frequency of civilizations solving alignment. You can’t buy insurance on a risk after learning whether or not it happened (maybe).
  but I disagree with his opinion that,
  Nate and Eliezer are known to go around telling people that their children are going to be killed by AIs with 90+% probability. If this objection about future civilizations not paying enough is their real objection, they should add a caveat that “Btw, we could significantly decrease the probability of your children being killed, by committing to use one-billionth of our resources in the far future for paying some simulated AIs, but we don’t want to make such commitments, because we want to keep our options open in case we can produce more Fun by using those resources for something different than saving your children”.
  Because it’s not enough to just get people living in base reality to survive the singularity and have a happy future. You still die unless there is a happy future for everyone real or simulated.
  - koanchuk 30 Jan 2026 7:38 UTC
    3 points
    0
    Parent
    Matolcsi’s post is an idea for making deals with the ASI.
    I notice that his proposal shares some basic characteristics with religion. You should believe that this world is a test: follow these rules, and you go to heaven; misbehave, and you go hell (or in this case, a softhearted re-imagination of hell). Indeed, it does work on people, sometimes.
    I imagine Actually Something Incomprehensible noticing the double irony of inverting the classic mantra “God says, I shall be good” into “Singularity, thou shalt be good”, combined with the fact that you refer to it as the devil. Who knows what it does with this information?
    I know what I’ll say if I ever get arrested: Let me be, set me free, or super-me will screw with thee!
    - Knight Lee 30 Jan 2026 8:27 UTC
      3 points
      0
      Parent
      Religion does work sometimes, it actually worked on Blaise Pascal who is among the most intelligent people of all time. He argued for the Pascal’s wager, saying that following religion is worth it because the gains are infinite and costs are finite, and we still don’t have a good reply to that. We don’t even have a good reply to Pascal’s mugging, where a random mugger says something like “Let me be, set me free, or super-me will screw with thee!” with an infinitely big promise or threat.
      Decision theory and acausal trade is really complicated and I have no idea what the ASI will actually do or think regarding the simulation promise/threat, it’s quite freaky imagining that haha.
      - koanchuk 30 Jan 2026 9:02 UTC
        2 points
        −1
        Parent
        Memetically, a religion certainly benefits from someone believing that accepting Pascal’s wager is the correct decision. My reply to it would be “which religion?”, since many make largely equivalent claims while also demanding exclusivity, and I assume that God in his infinite mercy understands the bind this puts people in. It also seems to me that accepting Pascal’s wager leads to something like the simulation of belief.
        Knight Lee 30 Jan 2026 9:38 UTC
        2 points
        0
        Parent
        I agree the “which religion,” “which mugger” is very fuzzy. I didn’t understand the simulation of belief or the link though :/
        koanchuk 30 Jan 2026 19:12 UTC
        1 point
        0
        Parent
        What I meant was that there seems to be a difference between “genuine” belief vs. converting as a result of accepting Pascal’s wager, which seems like a simulation of belief.
        The link is a koan; the idea of pretend-believing reminded me of the boy in it.