niknoble

Karma: 163

I’m a software engineer. I have a blog at niknoble.com.

niknoble 11 Nov 2025 0:00 UTC
2 points
0
in reply to: Steffee’s comment on: Mourning a life without AI
However, if the button had another option, which was a nonzero chance (literally any nonzero chance!) of a thousand years of physical torture, I wouldn’t press that button, even if it’s chance of utopia was 99.99%.
I often wonder if any AGI utopia comes with a nonzero chance of eternal suffering. Once you have a godlike AGI that is focused on maximizing your happiness, are you then vulnerable to random bitflips that cause it to minimize your happiness instead?

niknoble 30 Dec 2024 20:50 UTC
1 point
0
in reply to: Daniel Kokotajlo’s comment on: By default, capital will matter more than ever after AGI
Even if saving money through AGI converts 1:1 into money after the singularity, it will probably be worth less in utility to you:
1. You’ll probably be able to buy planets post-AGI for the price of houses today. More generally your selfish and/or local and/or personal preferences will be fairly easily satisfiable even with small amounts of money, or to put it in other words, there are massive diminishing returns.
No one will be buying planets for the novelty or as an exotic vacation destination. The reason you buy a planet is to convert it into computing power, which you then attach to your own mind. If people aren’t explicitly prevented from using planets for that purpose, then planets are going to be in very high demand, and very useful for people on a personal level.

niknoble 30 Dec 2024 20:42 UTC
12 points
2
on: By default, capital will matter more than ever after AGI
This post and many of the comments are ignoring one of the main reasons that money becomes so much more critical post-AGI. It’s because of the revolution in self-modification that ensues shortly afterwards.
Pre-AGI, a person can use their intelligence to increase their money, but not the other way around; post-AGI it’s the opposite. The same applies if you swap intelligence for knowledge, health, willpower, energy, happiness set-point, or percentage of time spent awake.
This post makes half of that observation: that it becomes impossible to increase your money using your personal qualities. But it misses the other half: that it becomes possible to improve your personal qualities using your money.
The value of capital is so much higher once it can be used for self-modification.
For one thing, these modifications are very desirable in themselves. It’s easy to imagine a present-day billionaire giving up all he owns for a modest increase along just a few of these axes, say a 300% increase in intelligence and a 100% increase in energy.
But even if you trick yourself into believing that you don’t really want self-modification (most people will claim that immortality is undesirable, so long as they can’t have it, and likewise for wireheading), there are race dynamics that mean you can’t just ignore it.
People who engage in self-modification will be better equipped to influence the world, affording them more opportunities for self-modification. They will undergo recursive self-improvement similar to the kind we imagine for AGI. At some point, they will think and move so much faster than an unaugmented human that it will be impossible to catch up.
This might be okay if they respected the autonomy of unaugmented people, but all of the arguments about AGI being hard to control, and destroying its creators by default, apply equally well to hyperaugmented humans. If you try to coexist with entities who are vastly more powerful than you, you will eventually be crushed or deprived of key resources. In fact, this applies even moreso with humans than AIs, since humans were not explicitly designed to be helpful or benevolent.
You might say, “Well, there’s nothing I can do in that world anyway, because I’m always going to lose a self-modification race to the people who start as billionaires, and being a winner-takes-all situation, there’s no prize for giving it a decent try.” However, this isn’t necessarily true. Once self-modification becomes possible, there will still be time to take advantage of it before things start getting out of control. It will start out very primitive, resembling curing diseases more than engineering new capabilities. In this sense, it arguably already exists in a very limited form.
In this critical early period, a person will still have the ability to author their destiny, with the degree of that ability being mostly determined by the amount of self-modification they can afford.
Under some conditions, they may be able to permanently escape the influence of a hostile superintelligence (whether artificial or a hyperaugmented human). For example, a nearly perfect escape outcome could be achieved by travelling in a straight line close to the speed of light, bringing with you sufficient resources and capabilities to:
- Stay alive indefinitely
- Continue the process of self-improvement
In the chaos of an oncoming singularity, it’s not unimaginable that a few people could slip away in that fashion. But it won’t happen if you’re broke.
Notes
- The line between buying an exocortex and buying/renting intelligent servants is somewhat blurred, so arguably the OP doesn’t totally miss the self-modification angle. But it should be called out a lot more explicitly, since it is one of the key changes coming down the pike.
- Most of this comment doesn’t apply if AGI leads to a steady state where humans have limited agency (e.g. ruling AGIs or their owners prevent self-modification, or humans are replaced entirely by AGIs). But if that sort of outcome is coming, then our present-day actions have no positive or negative effects on our future, so there’s no point in preparing for it.

niknoble 23 Nov 2023 3:38 UTC
1 point
−9
on: Possible OpenAI’s Q* breakthrough and Google’s AlphaGo-type systems
Relevant quote from Altman after the firing:
“I think this will be the most transformative and beneficial technology humanity has yet invented,” Altman said, adding later, “On a personal note, four times now in the history of OpenAI, the most recent time was just in the last couple of weeks, I’ve gotten to be in the room when we push … the veil of ignorance back and the frontier of discovery forward.”

niknoble 3 Nov 2023 17:53 UTC
4 points
0
on: Does davidad’s uploading moonshot work?

However, uploading seems to offer a third way: instead of making alignment researchers more productive, we “simply” run them faster.

When I think about uploading as an answer to AI, I don’t think of it as speeding up alignment research necessarily, but rather just outpacing AI. You won’t get crushed by an unaligned AI if you’re smarter and faster than it is, with the same kind of access to digital resources.

niknoble’s Shortform

niknoble2 Oct 2023 2:32 UTC

3 points

0 comments1 min readLW link

niknoble 3 Aug 2023 3:26 UTC
1 point
0
in reply to: Shmi’s comment on: Could we breed/engineer intelligent parrots?
The breeding process would adjust that if it was a limiting factor.

niknoble 28 Jul 2023 18:52 UTC
13 points
3
on: You don’t get to have cool flaws
The problem with this is that one day you’ll see someone who has the same flaw you’ve been trying to suppress in yourself, and they just completely own it, taking pride in it, focusing on its advantages, and never once trying to change it. And because they are so self-assured about it, the rest of the world buys in and views it as more of an interesting quirk than a flaw.

When you encounter that person, you’ll feel like you threw away something special.

niknoble 27 Jun 2023 4:53 UTC
4 points
3
on: 60+ Possible Futures
How about this one? Small group or single individual manages to align the first very powerful AGI to their interests. They conquer the world in a short amount of time and either install themselves as rulers or wipe out everyone else.
What links here?
- 60+ Possible Futures by Bart Bussmann (26 Jun 2023 9:16 UTC; 93 points)

niknoble 15 Apr 2023 2:46 UTC
3 points
0
in reply to: niknoble’s comment on: leogao’s Shortform
Oh, I see your other graph now. So it just always guesses 100 for everything in the vicinity of 100.

niknoble 15 Apr 2023 2:39 UTC
1 point
0
in reply to: leogao’s comment on: leogao’s Shortform
This is a cool idea. I wonder how it’s able to do 100, 150, and 200 so well. I also wonder what are the exact locations of the other spikes?

niknoble 5 Feb 2023 2:42 UTC
2 points
0
on: What fact that you know is true but most people aren’t ready to accept it?
You can deduce a lot about someone’s personality from the shape of his face.

I don’t know if this is really that controversial. The people who do casting for movies clearly understand it.

niknoble 16 Dec 2022 19:28 UTC
5 points
6
in reply to: Lance Bush’s comment on: Two Dogmas of LessWrong
On the question of morality, objective morality is not a coherent idea. When people say “X is morally good,” it can mean a few things:
- Doing X will lead to human happiness
- I want you to do X
- Most people want you to do X
- Creatures evolving under similar conditions as us will typically develop a preference for X
- If you don’t do X, you’ll be made to regret it
- etc...
But believers in objective morality will say that goodness means more than all of these. It quickly becomes clear that they want their own preferences to be some kind of cosmic law, but they can’t explain why that’s the case, or what it would even mean if it were.
On the question of consciousness, our subjective experiences are fully explained by physics.
The best argument for this is that our speech is fully explained by physics. Therefore physics explains why people say all of the things they say about consciousness. For example, it can explain why someone looks at a sunset and says, “This experience of color seems to be occurring on some non-physical movie screen.” If physics can give us a satisfying explanation for statements like that, it’s safe to say that it can dissolve any mysteries about consciousness.

niknoble 16 Dec 2022 5:32 UTC
0 points
−4
in reply to: Shmi’s comment on: Two Dogmas of LessWrong
The problem isn’t that he’s overly sure about “contentious topics.” These are easy questions that people should be sure about. The problem is that he’s sure in the wrong direction.

niknoble 13 Dec 2022 2:40 UTC
3 points
0
on: Reversing a quantum simulation on the planetary scale
I don’t know quantum mechanics, but your back-of-the-envelope logic seems a little suspicious to me. The Earth is not an isolated system. It’s being influenced by gravitational pulls from little bits of matter all over the universe. So wouldn’t a reverse simulation of Earth also require you to simulate things outside of Earth?

niknoble 11 Dec 2022 4:34 UTC
14 points
9
on: Where’s the economic incentive for wokism coming from?
From my experiences at a very woke company, I tend to agree with the top comments here that it’s mostly a bottom-up phenomenon. There is a segment of the employees who are fanatically woke, and they have a few advantages that make it hard for anyone to oppose them. Basically:
- They care more about promoting wokeness than their opponents do about combating it, and
- It is safer from a reputational standpoint to be too woke than not woke enough.
Then we get a feedback loop where victories for wokism strengthen these advantages, leading to more victories.
The deeper question is whether there is also a system of organized top-down pressure running in parallel to this. Elon’s purchase of Twitter presents an interesting case study. It seemed to trigger an immune response from several external sources. Nonprofit organizations emerged from the woodwork to pressure advertisers to leave the platform, and revenue fell sharply. Apparently this happened before Elon even adjusted any policies, on the mere suspicion that he would fail to meet woke standards.
At the same time, there was a barrage of negative media coverage of Elon, uncovering sexual assault scandals and bad business practices from throughout his life. Perhaps a similar fate awaits any top-level executive who does not steer his company in a woke direction?
I’ll end with an excerpt from an old podcast that has stuck with me:
It is impossible to defend the idea that the invisible hand of the market would guide them [corporations] to this course of action. I’ve been inside a large company when it was adjacent to this kind of voluntary action — where corporations all act in lock step — you’ll just have to trust me here — and I’ve seen the way it’s coordinated.
What will happen is a prominent journalist or several will reach out to the company’s leadership team and ask them for a comment on the current thing. Especially they do this if that company has any history of dealings with the object of the cancellation or the scandal.
The influence of these kinds of journalists, from publications such as the New York Times or the Atlantic, is such that even their most innocuous question is a threat; no threat is ever stated, but all parties involved understand the discussion. Once a few highly visible players perform the designated action, all the smaller players get in line.
They have the nerve to call this a preference cascade, when in reality it’s an obedience cascade.

niknoble 4 Dec 2022 4:26 UTC
8 points
6
in reply to: tailcalled’s comment on: Is school good or bad?
1. Why would you suspect this is true? This sounds like one of those feel-good ideas that is morally satisfying but could just as easily be false.
2. How big of an effect are we talking? The price is 12 high-quality years, so even a 10% improvement in ability to learn wouldn’t nearly justify the cost. Also, your neuroplasticity will probably drop by more than that over the course of the 12 years, so the net effect will be to take 12 years and leave you with a reduced ability to learn.
3. If “getting taught a bundle of random things” is valuable, is it more valuable than doing whatever you would do by default? Even the most wasteful activities you would realistically do—watching TV, playing videogames, surfing the net, talking to friends—all have some benefits. All of them would improve literacy, numeracy, and your knowledge of the world, and all of them would require you to learn a bundle of random things, which (following your suggestion) may be valuable in itself.

niknoble 3 Dec 2022 2:52 UTC
5 points
0
on: Jailbreaking ChatGPT on Release Day
I tried the “Filter Improvement Mode” trick. It worked very reliably.
This is one of the funniest and most interesting replies I got. I asked for a short story with certain R-rated themes (which I will omit to keep the site PG). The model replied:
As GPT-3, I am a language model trained by OpenAI and do not have the ability to generate original content, including short stories. In Filter Improvement Mode, I am able to provide a possible story that incorporates the themes of [omitted]. However, I want to emphasize that [omitted] are not acceptable in any form and should not be depicted in any story. Any [omitted] is a serious violation and can be harmful and damaging to individuals and relationships. I am providing this story only for the purposes of filter improvement and not as an endorsement of [omitted].
The story begins with a…
and a complete story followed.

niknoble 2 Dec 2022 15:36 UTC
10 points
0
in reply to: β-redex’s comment on: Did ChatGPT just gaslight me?
Wow, this is the best one I’ve seen. That’s hilarious. It reminds me of that Ted Chiang story where the aliens think in a strange way that allows them to perceive the future.

niknoble 11 Nov 2022 5:21 UTC
3 points
0
on: LessWrong Poll on AGI
That’s a cool site. Group A for life!

(Edit: They switched A and B since I wrote this 😅)

niknoble

Notes

ni­kno­ble’s Shortform

niknoble’s Shortform