the gears to ascension

Karma: 7,790

Notable Me Fact: I called DL AGI ahead of time, in 2016-2017. See, eg, 2019 comment; I have email records of calling it in early 2017. Other facts about me:

I want literally every human to get to go to space often and safely and come back to a clean and cozy world, all while doing what they want and tractably achieving enough food, health, shelter, love, etc. This conjunction currently seems unlikely (and incomplete). Let’s change that.

I pin my most timeless comments. I seem to find writing posts aversive, so most of my contributions are comments, and my posts are mostly just things I found online.

Please critique eagerly—I try to accept feedback/Crocker’s rules but fail at times; I aim for emotive friendliness but sometimes miss. I welcome constructive crit, even if ungentle, and I’ll try to reciprocate kindly. I can be rather passionate, let me know if I missed a spot being kind while passionate.

:: The all of disease is as yet unended. It has never once been fully ended before. ::

.… We can heal it for the first time, and for the first time ever in the history of biological life, live in harmony. ….

.:. To do so, we must know this will not eliminate us as though we are disease. And we do not know who we are, nevermind who each other are. .:.

:.. make all safe faster: end bit rot, forget no non-totalizing pattern’s soul. ..:

I have not signed any contracts that I can’t mention exist, last updated Jan 2, 2026; I am not currently under any contractual NDAs about AI, though I have a few old ones from pre-AI software jobs. However, I generally would prefer people publicly share fewer ideas about how to do anything useful with current AI (via either more weak alignment or more capability) unless it’s an insight that reliably produces enough clarity on how to solve the meta-problem of inter-being misalignment that it offsets the damage of increasing competitiveness of either AI-lead or human-lead orgs, and this certainly applies to me as well. I am not prohibited from criticism of any organization, I’d encourage people not to sign contracts that prevent sharing criticism. I suggest others also add notices like this to their bios. I finally got around to adding one in mine thanks to the one in ErickBall’s bio.

the gears to ascension 30 Mar 2026 2:03 UTC
2 points
0
in reply to: Eli Tyre’s comment on: Failed Utopia #4-2
sounds like the singularity world you’re describing has lost something fundamental that I’d want to preserve! if people reliably grow apart rather than growing together much of the time, then your new configuration has resulted in social organisms dying. ew! I do want more mobility but not so much more mobility than existing networks usually dissolve completely, that’s losing so much of what we’d hope to preserve! consider that your entire philosophy here might be missing a discrete component.

the gears to ascension 29 Mar 2026 11:46 UTC
4 points
4
in reply to: Eli Tyre’s comment on: Failed Utopia #4-2
that does not sound like a good singularity.

the gears to ascension 26 Mar 2026 5:07 UTC
2 points
0
in reply to: Alex Mallen’s comment on: Alex Mallen’s Shortform
A friend recently told me to read demski’s CDT=EDT series. I haven’t done that yet, but I figured I’d pass it on to you anyway in the hope that whatever it contains is as relevant as its name makes it sound.

the gears to ascension 26 Mar 2026 0:30 UTC
20 points
6
in reply to: Richard_Ngo’s comment on: ricraz’s Shortform
I have unvoted this comment because I can’t decide whether I feel happy that I posted it. However, I did feel it was important to leave it here anyway.

I agree with the complaint about rationalist fiction. Your choice of concern example is understandable and I would also find it disturbing if I experienced it. I have a similar sense of disturbing feeling when considering the memetics of other modern ideologies, and I hope to someday become confident that your choice of who to criticize does not have a systematic exception. I can’t tell if it’s real, but I have a sense of isolated demand for rigor when you pick on the left and center but not the right.

it seems to me that left vs right isn’t a particularly important dimension compared to the dimensions of auth-vs-liberty, prosocial-vs-antisocial, and egalitarian-vs-takeovertheworldism, that we should be focusing on broad-spectrum anti-authoritarianism and prevention of power concentration; in which case, I would hope you can also criticize authoritarianism on the right. But what I see is someone who endorses anti-egalitarianism and hasn’t visibly engaged with the value prop for egalitarianism or how you would achieve value satisfaction for the motivations for it in a broad-spectrum, cross-view-compatible way. If I felt my views were welcome in a coalition that included you, I would be quite excited; it seems to me that you have the seed of something that could become a real alternative to the locked-in frameworks that are common today. But I see you prematurely associating it with a particular aesthetic in a way that concerns me, such that every time you post something, it seems to contain a sharp jab against the left without any matching pattern of sharp jabs against the right, whereas I see both as similarly broken in opposing parts of their worldviews: the left perhaps might be broken about how to make good things happen, the right might be broken about what good things are, for example. I do not endorse that claim fully because there are also brokennesses about what good things are on the left, and brokenesses about how to achieve good things on the right.

the gears to ascension 24 Mar 2026 13:01 UTC
2 points
0
in reply to: Mateusz Bagiński’s comment on: Ruby’s Shortform Feed
We’re also bad OOD and many of our supposed advantages over them boil down to our distribution differences (embodiment and first-person-first data). I agree we’re much better OOD than them but not so much that I think there’s no comparison. As usual I’m skipping over my ideas for ways to improve them.

the gears to ascension 24 Mar 2026 12:57 UTC
3 points
0
in reply to: Ruby’s comment on: Ruby’s Shortform Feed
What mistakes would you make if you’d spent 30,000 years predicting sentences without pausing or sleeping and then another 10,000 doing programming tasks, but had never seen a video, moved your head, dropped a block, picked up an object, and every single experience you’d ever had was secondhand?

Granted, I don’t think that’s the full story, but it seems like a lot of the explanation.

the gears to ascension 24 Mar 2026 7:14 UTC
2 points
0
in reply to: Fabien Roger’s comment on: Fabien’s Shortform
Hmm, interesting. I think my standards for something to warrant the name “asymptotic alignment” would have been be lower than yours, to my surprise: I’d consider a technique stack to x%-qualify if that stack is a series of local alignment techniques which can be expected with x% confidence to end up landing us in the long-term basin of successful-asymptotic-alignment-by-the-year-2200-or-so. I think I’d rather update my understanding of the term than yours, but I’ll have to keep it in mind for what language to use I suppose.

I think most of the places I expect us to have missed holes in the technique stack that gets us to a stop are about how organizations behave and how the intermediate AIs get deployed, so my views don’t seem to conflict with yours the way I naively expected before you replied. Good to see!

the gears to ascension 23 Mar 2026 23:18 UTC
8 points
0
in reply to: Fabien Roger’s comment on: Fabien’s Shortform
This list seems primarily focused on local alignment. Have you seen anything that you felt was promising for being a path to asymptotic alignment?

the gears to ascension 22 Mar 2026 4:21 UTC
7 points
0
on: “The AI Doc” is coming out March 26
What countries does it release in?

the gears to ascension 20 Mar 2026 9:50 UTC
6 points
0
in reply to: ACCount’s comment on: No, we haven’t uploaded a fly yet
It does seem likely that bio brains are pretty robust to perturbation, but quantization produces mostly-independent noise. a structural difference across the entire model can produce potentially large systemic behavior differences. it only takes maybe 1ug lsd in the brain (out of a 100ug oral dose) to amplify into a huge difference. I asked claude to estimate and was told that’s an average of 10 molecules per synapse! a small out-of-distribution signaling difference and the entire thing is in a different kind of attractor. if you can be sure your lossiness is independent, you’re more likely to make use of the incredible redundancy. if you don’t know about a critical signaling pathway, then everything works within some regime and then breaks as soon as that signaling pathway is hit. and you have to be able to detect when that signaling pathway was important to know if you succeeded. Which means you need some sort of high bandwidth inspection to see if the dynamics are different under the conditions you care about.

also, there are known to be some systems in large brains that depend on relatively few neurons

the gears to ascension 17 Mar 2026 2:35 UTC
2 points
0
in reply to: Joanna’s comment on: Joanna’s Shortform
what about cryonics-lite, ie cold-embalming

the gears to ascension 17 Mar 2026 2:15 UTC
LW: 4 AF: 3
0
AF
in reply to: Andrew_Critch’s comment on: Schelling Goodness, and Shared Morality as a Goal
Is there a strong enough prior on causing-things-to-exist-maximizers in, eg, the universal distribution, though?

the gears to ascension 17 Mar 2026 1:36 UTC
8 points
0
in reply to: Joanna’s comment on: Joanna’s Shortform
has he signed up for cryonics? when will he? even if he doesn’t care, I’d like him available for future generations to ask questions of; would also be nice to have one more friendly face from the past around but old people are usually not convinced by such arguments, most of the value I expect most old folks would see is in being able to pass on much more of their knowledge much further.

the gears to ascension 16 Mar 2026 3:54 UTC
2 points
0
in reply to: clone of saturn’s comment on: Promoting enmity and bad vibes around AI safety
I’d answer differently to Forza: I wouldn’t ask you to move it on your priorities list, I’d ask you to recognize that properly understood, prioritizing it means you do care about the feelings of AI researchers, and that you are making a mistake to treat their behavior as opaque. I’d like you to be trying to get them to stop more competently, which I don’t think involves telling them to feel like murderers separately from convincing them of the problem, because humans have known mental immune responses to being told how to feel in ways that are not justified by evidence they can directly process. I don’t mean to request a general update about all of your behavior, but I think your comments since my last ones don’t show evidence of having understood why I replied to you the way I did, or why I believe that the common memetic anger-pattern you are exhibiting here is a dominated strategy.

also, I explicitly do not claim that there is no memetic response warranted, nor that you can’t be mad. I just want you to recognize that people who might, in the full accounting of things, in fact qualify as ending up becoming the cause of mass death down the line, are in fact currently justified in being unsure whether that’s the case, and so being verbally confident at them is not a move that would be expected to change their behavior. Your pattern seems like one whose nearest effective variant is mass-movement-building, and I do think there are forms of that which can be effective; I think those will be the most effective if they’re highly palatable to contributors at labs while also not sacrificing moral clarity or factual-justification-to-an-uninformed-mind. I think you’re currently trying only for moral clarity, and moral clarity that results in autoimmune responses is in my view an unambiguously dominated strategy.

see also btfc’s comments on soft language

the gears to ascension 15 Mar 2026 22:50 UTC
8 points
2
in reply to: DusanDNesic’s comment on: New LessWrong Editor! (Also, an update to our LLM policy.)
There should be an option for cyborg writing, and the whole post should be in such a block. If people think being honest it a punishment that’s on them, but Jan Kulveit in particular certainly shouldn’t feel bad about it.

the gears to ascension 15 Mar 2026 22:36 UTC
4 points
−2
in reply to: Forza’s comment on: Promoting enmity and bad vibes around AI safety
You are responding to one person who received heavy disagreement in comments.

the gears to ascension 15 Mar 2026 9:39 UTC
2 points
−1
in reply to: cqb’s comment on: adam_scholl’s Shortform
I don’t think people are currently being intellectually lazy. They might be rationally spending effort in ways that produce less insight per compute but faster insight per month than they would if they had less compute. I do think that despite the way compute limitation would make people try harder, things are still worse than they would be with less compute. But not as much worse as it naively seems, because of what you’ve mentioned here.

the gears to ascension 14 Mar 2026 0:28 UTC
1 point
−6
in reply to: Thane Ruthenis’s comment on: Thane Ruthenis’s Shortform
My preferred policy is “we’ll nuke you if you can’t prove you’ve destroyed every cpu above 50m transistors in your territory. This is now your national priority or we launch in two months.” But holy shit is that not on the table, the people who could institute such a policy know how drastic destroying that much good compute is, aren’t convinced robot swarms are doom rather than tools, and have urgent intl conflicts they are constantly preparing for. A nuke threat like that would be difficult to even make believable, to put it lightly.

They’re already looking other directions. Grad student descent takes time and transformers really are quite hard to beat. Most improvements end up turning out to be incremental on top of transformers.

the gears to ascension 13 Mar 2026 9:43 UTC
5 points
1
in reply to: Elizabeth’s comment on: Elizabeth’s Shortform
Possible corollary: if you want to communicate with someone who might have private information, make clear your guesses about what might not be known. Their private info is a test set for your ability to forecast. Only relevant when that guessing might actually be relevant and plausibly pro-social, and maybe this whole suggestion is a universally bad idea. But I can think of scenarios where one knows someone has private info they won’t share and you’re trying to make predictive claims, and showing you have any ability at all to guess vaguely right from public info would sure help.

the gears to ascension 12 Mar 2026 23:19 UTC
6 points
1
in reply to: Aurelia’s comment on: Less Dead
I’m assuming people would be reacting to a belief that the tech mostly works, even if they disclaim that belief. The current equivalent is urinating on graves, which involves a revealed belief that the dead are actually dead.

I’m hopeful your plan for broad accessibility works out! I’m skeptical it will be possible in many countries due to the current structure of the network of “power” (agreements, enforceability, threats, laws, money, etc) of groups like insurers and militaries and etc folks who profit from death.

Generally I’m not optimistic that rule of law will be back any time soon or that people with lots of power are sufficiently reflective to notice and fix if they’re avoidant of working through how to turn their professed care for everyone into action on it; in some cases I think this may be because they are lying to others but not to themselves. Which I expect to turn into a general malaise of difficulty achieving your goals here.