Eliezer’s Unteachable Methods of Sanity

Eliezer Yudkowsky7 Dec 2025 2:46 UTC

492 points

“How are you coping with the end of the world?” journalists sometimes ask me, and the true answer is something they have no hope of understanding and I have no hope of explaining in 30 seconds, so I usually answer something like, “By having a great distaste for drama, and remembering that it’s not about me.” The journalists don’t understand that either, but at least I haven’t wasted much time along the way.

Actual LessWrong readers also sometimes ask me how I deal emotionally with the end of the world.

I suspect a more precise answer may not help. But Raymond Arnold thinks I should say it, so I will say it.

I say again, I don’t actually think my answer is going to help. Wisely did Ozy write, “Other People Might Just Not Have Your Problems.” Also I don’t have a bunch of other people’s problems, and other people can’t make internal function calls that I’ve practiced to the point of hardly noticing them. I don’t expect that my methods of sanity will be reproducible by nearly anyone. I feel pessimistic that they will help to hear about. Raymond Arnold asked me to speak them anyways, so I will.

Stay genre-savvy / be an intelligent character.

The first and oldest reason I stay sane is that I am an author, and above tropes. Going mad in the face of the oncoming end of the world is a trope.

I consciously see those culturally transmitted patterns that inhabit thought processes aka tropes, both in fiction, and in the narratives that people try to construct around their lives and force their lives into.

The trope of somebody going insane as the world ends, does not appeal to me as an author, including in my role as the author of my own life. It seems obvious, cliche, predictable, and contrary to the ideals of writing intelligent characters. Nothing about it seems fresh or interesting. It doesn’t tempt me to write, and it doesn’t tempt me to be.

It would not be in the interests of an intelligent protagonist to amplify their own distress about an apocalypse into more literarily dramatic ill-chosen behavior. It might serve the interests of a hack author but it would not help the character. Understanding that distinction is the first step toward writing more intelligent characters in fiction. I use a similar and older mental skill to decide which tropes to write into the character that is myself.

This sense—which I might call, genre-savviness about the genre of real life—is historically where I began; it is where I began, somewhere around age nine, to choose not to become the boringly obvious dramatic version of Eliezer Yudkowsky that a cliche author would instantly pattern-complete about a literary character facing my experiences. Specifically, though I expect this specific to mean nothing to a supermajority of you, I decided that as a relatively smart kid I would not become Raistlin Majere, nor ever exhibit a large collection of related tropes.

The same Way applies, decades later, to my not implementing the dramatic character a journalist dreams up—a very boring and predictable pattern-completion of a character—when they dream up a convenient easy-to-write-about Eliezer Yudkowsky who is a loudly tortured soul about his perception of the world’s end approaching along its default course.

“How are you coping?” journalists sometimes ask me, and sometimes nowadays they have become worried themselves and want to know for themselves if there’s a key to coping. But often today, and before ChatGPT almost always, they are planning a Character-Focused Story about how my Tortured Soul deals with an imaginary apocalypse, to exhibit to their readers like a parent takes their kids to the zoo to stare at a strange animal. I reply to them “I have a great distaste for drama”, but the actual answer is “I am a better writer than you, and I decided not to write myself as that incredibly cliche person that would be easy and convenient for you to write about.”

“Going insane because the world is ending” would be a boring trope and beneath my dignity to choose as my actual self’s character.

Don’t make the end of the world be about you.

“How are you coping with the end of the world?” journalists sometimes ask me, and I sometimes reply, “By remembering that it’s not about me.” They have no hope of understanding what I mean by this, I predict, because to them I am the subject of the story and it has not occurred to them that there’s a whole planet out there too to be the story-subject. I think there’s probably a sense in which the Earth itself is not a real thing to most modern journalists.

The journalist is imagining a story that is about me, and about whether or not I am going insane, not just because it is an easy cliche to write, but because personality is the only real thing to the journalist.

This is also a pattern that you can refuse, when you write the story that is yourself; it doesn’t have to be a story that is ultimately about you. It can be about humanity, humane preferences, and galaxies. A sentence about snow is words, is made of words, but it is about snow. You are made of you, but you don’t need to be all about yourself.

If I were to dwell on how it impacted me emotionally that the world was ending, I would be thinking about something which genuinely doesn’t matter to me very much compared to how the world is ending. Having dramatic feelings is not mostly what I am about—which is partly how I ended up being not much made of them, either; but either way, they’re not what I’m about.

So long ago that you probably can’t imagine what it was like back then, not just before ChatGPT but years before the age of deep learning at all, there was a person who thought they were like totally going to develop Artificial General Intelligence. Then they ran into me; and soon after, instead started agonizing about how they had almost destroyed the world. Had they actually been that close to success? Of course not. But I don’t relate to status as most people do, so that part, the status-overreach, wasn’t the part I was rolling my eyes about. It is not the sort of epistemic prediction error that I see as damnable in the way that a status-regulator sees it as the worst thing in the world; to underestimate oneself is no more virtuous than to overestimate oneself. Rather, I was rolling my eyes about the part that was a more blatant mistake, completely apart from the epistemic prediction error they probably couldn’t help; the part that would have been a mistake even if they had almost destroyed the world. I was rolling my eyes about how they’d now found a new way of being the story’s subject.

Even if they had almost destroyed the world, the story would still not properly be about their guilt or their regret, it would be about almost destroying the world. This is why, in a much more real and also famous case, President Truman was validly angered and told “that son of a bitch”, Oppenheimer, to fuck off, after Oppenheimer decided to be a drama queen at Truman. Oppenheimer was trying to have nuclear weapons be about Oppenheimer’s remorse at having helped create nuclear weapons. This feels obviously icky to me; I would not be surprised if Truman felt very nearly the same.

And so similarly I did not make a great show of regret about having spent my teenage years trying to accelerate the development of self-improving AI. Was it a mistake? Sure. Should I promote it to the center of my narrative in order to make the whole thing be about my dramatic regretful feelings? Nah. I had AGI concerns to work on instead.

I did not neglect to conduct a review of what I did wrong and update my policies; you know some of those updates as the Sequences. But that is different from re-identifying myself as a dramatic repentent sinner who had thereby been the story’s subject matter.

In a broadly similar way: If at some point you decide that the narrative governing your ongoing experience will be about you going insane because the world is ending: Wow, congratulations at making the end of the world still be about you somehow.

Just decide to be sane, and write your internal scripts that way.

The third way I stay sane is a fiat decision to stay sane.

My mental landscape contains that option; I take it.

This is the point I am even less expecting to be helpful, or to correspond to any actionable sort of plan for most readers.

I will nonetheless go into more detail that will probably not make any sense.

Besides being a thing I can just decide, my decision to stay sane is also something that I implement by not writing an expectation of future insanity into my internal script / pseudo-predictive sort-of-world-model that instead connects to motor output.

(Frankly I expect almost nobody to correctly identify those words of mine as internally visible mental phenomena after reading them; and I’m worried about what happens if somebody insists on interpreting it anyway. Seriously, if you don’t see phenomena inside you that obviously looks like what I’m describing, it means, you aren’t looking at the stuff I’m talking about. Do not insist on interpreting the words anyway. If you don’t see an elephant, don’t look under every corner of the room until you find something that could maybe be an elephant.)

One of the ways you can get up in the morning, if you are me, is by looking in the internal direction of your motor plans, and writing into your pending motor plan the image of you getting out of bed in a few moments, and then letting that image get sent to motor output and happen. (To be clear, I actually do this very rarely; it is just a fun fact that this is a way I can defeat bed inertia.)

There are a lot of neighboring bad ideas to confuse this with. The trick I’m describing above does not feel like desperately hyping myself up and trying to believe I will get out of bed immediately, with a probability higher than past experience would suggest. It doesn’t involve lying to myself about whether I’m likely to get up. It doesn’t involve violating the epistemic-instrumental firewall (factual questions absolutely separated from the consequences of believing things), to give myself a useful self-fulfilling prophecy. It is not any of the absurd epistemic-self-harming bullshit that people are now flogging under brand names like “hyperstition”, since older names like “chaos magick” or “lying to yourself” became less saleable. I still expect to them to point to this and say, “Why, of course that is the same thing I am selling to you as ‘hyperstition’!” because they would prefer not to look at my finger, never mind being able to see where I’m pointing.

With that said: The getting-out-of-bed trick involves looking into the part of my cognition where my action plan is stored, and loading an image into it; and because the human brain’s type system is a mess, this has the native type-feeling of an expectation or prediction that in a few seconds I will execute the motor-plan and get out of bed.

That I am working with cognitive stuff with that type-feel, is not the same thing as lying to myself about what’s likely to happen; no, not even as a self-fulfilling prophecy. I choose to regard the piece of myself whose things-that-feel-like-predictions get sent as default motor output, as having the character within my Way of a plan I am altering; rather than, you know, an actual mistaken prediction that I am believing. If that piece of myself gets to have me roll out of bed, I get to treat it as a plan rather than as a prediction. It feels internally like a prediction? Don’t believe everything you feel. It’s a pseudo-model that outputs a pseudo-prediction that does update in part from past experience, but its actual cognitive role is as a controller.

The key step is not meditating on some galaxy-brained bullshit about Lob’s Theorem, until you’ve convinced yourself that things you believe become true. It’s about being able to look at the internal place where your mind stores a pseudo-predictive image of staying in bed, and writing instead a pseudo-prediction about getting out of bed, and then letting that flow to motor output three seconds later.

It is perhaps an unfortunate or misleading fact about the world (but a fact, so I deal with it), that people telling themselves galaxy-brained bullshit about Lob’s Theorem or “hyperstition” may end up expecting that to work for them; which overwrites the pseudo-predictive controlling output, and so it actually does work for them. That is allowed to be a thing that is true, for reality is reality. But you don’t have to do it the scrub’s way.

Perceiving my internal processes on that level, I choose:

I will not write internal scripts which say that I am supposed to / pseudo-predict that I will, do any particular stupid or dramatic thing in response to the end of the world approaching visibly nearer in any particular way.

I don’t permit it as a narrative, I don’t permit it as a self-indulgence, and I don’t load it into my pseudo-predictive self-model as a pending image that gets sent by default to internal cognitive motor outputs.

If you go around repeating to yourself that it would be only natural to respond to some stressful situation by going insane—if you think that some unhelpful internal response is the normal, the default, the supposed-to reaction to some unhelpful external stimulus—that belief is liable to wire itself in as being also the pseudo-prediction of the pseudo-model that loads your default thoughts.

One could incorrectly summarize all this as “I have decided not to expect to go insane,” but that would violate the epistemic-instrumental firewall and therefore be insane.

(All of this is not to be confused with the confused doctrine of active inference. That a brain subsystem sometimes repurposes a previously evolved piece of predictive machinery as a generalizing cache system that then sends its outputs as control signals, does not reveal some deep law about prediction and planning being the same thing. They’re not. Deep Blue made no use of that idiom, purely separated prediction from planning, and worked just fine. The human brain is just a wacky biological tangle, the same way that human metabolism repurposes the insanely reactive chemical byproduct of superoxide as a key signaling molecule. It doesn’t have to be that way for deep theoretical reasons; it’s just biology being a tangle.)

(All of this is not to be confused with the Buddhist doctrine that every form of negative internal experience is your own fault for not being Buddhist enough. If you rest your hand on a hot stove, you will feel pain not because your self-pseudo-model pseudo-predicts this to be painful, but because there’s direct nerves that go straight to brain areas and trigger pain. The internal mechanism for this does not depend on a controlling pseudo-prediction, it just falls downward like a stone under gravity. The same directness is allowed to be true about suffering and not just pain; if there’s a clever way to overwrite pseudo-predictions of suffering and thereby achieve Buddhist indifference to bad things, I don’t have it as a simple obvious surface lever to pull. I also haven’t chosen to go looking for a more complicated or indirect version of it. I do not particularly trust that to end well.

But I do think there are various forms of drama, error, and insanity which are much more like “things people do because they expected themselves to do it”; and much less like the pain, or suffering, from burning your hand.)

There’s an edition of Dungeons and Dragons that has a god of self-improvement, called Irori. My fanfictions sometimes include characters that worship Him (heresy), or seek what He sought (approved).

In my fictional reification, Irori’s religion has mottos like, “You don’t have problems, you have skill issues.” Irorians can be a bit harsh.

But even if something is a skill issue, that doesn’t mean you have the skill, nor know how to solve it.

When an Irorian calls something a skill issue, they’re not instructing you to feel bad about having not solved it already.

They are trying to convey the hope that it is solvable.

Doing crazy things because your brain started underproducing a neurotransmitter is a problem. It wouldn’t be very Irorian to tell you that you can’t solve it just through even clearer thinking; but if there’s a medication that directly fixes the problem, that is probably easier and faster and more effective. Also, this isn’t Dungeons and Dragons, Irori isn’t real, and possibly you genuinely can’t solve a neurotransmitter problem by thinking at it.

Doing crazy things because the world is ending is a skill issue.

These then are Eliezer Yudkowsky’s probably-irreproducible ways of staying sane as the world seems more visibly close to ending:

A distaste for the boringly obvious trope of a character being driven mad by impending doom;

Not making the story be all about me, including my dramatically struggling to retain my sanity;

And a fiat decision to stay sane, implemented by not instructing myself that any particular stupidity or failure will be my reaction to future stress.

Probably you cannot just go do those three things.

Then figure out your own ways of staying sane, whether they be reproducible or irreproducible; and follow those ways instead.

The reason that I tell you of my own three methods, is not to provide an actionable recipe for staying sane as the world begins to seem visibly closer to ending.

It is an example, a reminder, and maybe even an instruction to a part of yourself that produces self-pseudo-predictions that get loaded as your internal mental behavior:

Sanity is a skill issue.

What links here?

Eliezer Yudkowsky7 Dec 2025 2:46 UTC

492 points

152 comments10 min readLW link

Practical Rationality Death

Raemon 7 Dec 2025 9:55 UTC
92 points
27
Thanks!
The reason I asked you to write some-version-of-this is, I have in fact noticed myself veering towards a certain kind of melodrama about the whole x-risk thing, and I’ve found various flavors of your “have you considered just… not doing that?” to be helpful to me. “Oh, I can just choose to not be melodramatic about things.”
(on net I am still fairly relatively dramatic/narrative-shaped as rationalists go, but, I’ve deliberately tuned the knob in the other direction periodically and think various little bits of writing of yours has helped me)
I liked the framing you did at Solstice of it as a general prompt to treat it as a skill issue without being about the exact recipe.
- Nathan Young 13 Dec 2025 11:02 UTC
  5 points
  2
  Parent
  Seems like lots of people found this valuable, so bayes points I guess.
Kaj_Sotala 7 Dec 2025 8:57 UTC
67 points
33
I read this as being premised on “going crazy about the world ending” meaning that you end up acting obviously stupid and crazy, with the response basically being “find a way to not do that”.
My model about going crazy at the end of the world isn’t so much doing something that’s obviously crazy in your own view, but that the world ending is so out-of-distribution for everything you’ve been doing so far that you have no idea of what even is a sane or rational response anymore. For instance, if your basic sense of meaning has been anchored to a sense of the world persisting after you and you making some kind of mark on the world, you won’t know what to do with your life if there won’t be anything to make a mark on.
So staying sane requires also knowing what to do, not just knowing what not to do. Is there anything you would say about that?
- Eliezer Yudkowsky 7 Dec 2025 10:38 UTC
  68 points
  32
  Parent
  Base plan: Stay still, die quietly.
  
  There, you now have a better plan than going crazy! If you think up an even better plan you can substitute that one. Meliorization!
  - jimmy 7 Dec 2025 19:21 UTC
    25 points
    10
    Parent
    The point is that “maintaining sanity” is a (much) higher bar than “Don’t flail around like a drama queen”. Maintaining sanity requires you to actually update on the situation you find yourself in, and continue to behave in ways that make sense given the reality as it looks after having updated on all the information available. Not matching obvious tropes of people losing their mind is a start, but it is no safe defense. Especially since not all repeated/noticeable failure modes are active and dramatic, and not all show up in fiction.
    For example, if there’s something to David Gross’s comment that the wretched journalist was actually giving you an opening because they saw importance in what you had to say about the situation, blowing off a genuine opening to influence the discourse on AI safety while calling it “doing nothing” would not be sane. Preemptive contempt has a purpose in bounded rationality, but it’s still a form of pushing away from the information the journalist has to offer. It can make sense within a grand plan that weights this journalist low, but that requires a grand plan.
    How do you actually orient to the world, now that we are what we are? Are you still working to bring about the good outcome? If so, what’s the grand plan that ties everything together? Sharing that seems important for helping people retain sanity. Have you given up? If so, what is the overarching plan that drives how you choose to interact with the world? Because you still have to decide what to do with your time.
    This is a hell of a problem to orient to, and I don’t know that any of us get to say we’re doing it sanely. It’s a high bar to strive towards.
    The trope that this post and comment match to me isn’t one that shows up in science fiction. It’s a real bitch to wrestle free from, because the whole premise has to do with protecting stability of sense making by pushing away from challenging updates with avoidance and contempt, and the whole project fails if it doesn’t turn meta and resist awareness of the trope. I notice that even writing and rewriting this comment to be minimally threatening of stability without holding back content, it’s going to be a tough one to engage with to the extent that there isn’t a preexisting superstructure regulating contact with reality to maintain stability while minimizing the cost of missed updates.
    Which is certainly a possibility. As is leveraging the skill of becoming genre savvy as new patterns emerge (“trope dodging”).
    So if this contempt provokes contempt quickly, I’m sorry. My best isn’t always good enough, which is kinda the possibility we’re all wrestling with here.
  - TristanTrim 8 Dec 2025 19:34 UTC
    12 points
    4
    Parent
    I agree with this in a “catgirl volcano utopia” kinda way, but I think Kaj_Sotala was pointing more to a “words as pointers to locations in thingspace” issue. The word “sane” points to taking actions that work in the context you’re facing. It isn’t sane to shout about the sky falling when the sky isn’t falling and it’s easy for sane people to notice that the sky isn’t falling and that shouting about it is insane. But there isn’t an obvious plan for what you should do when the sky really is falling, so if the sky starts falling in ways that are obvious and difficult for normal people to ignore, then the thingspace cluster that “sane” used to point to starts to come apart.
    
    I like expanding “sane” to something like “know what’s true and do what works”… it’s an impossible standard but something to aspire to.
    
    It seems “sane” may also point to “not indulging in dramatic emotional expressions”, like not screaming, not crying, not punching inanimate objects. But pathos works. Emotions make characters in stories relatable. So the goal isn’t to stay sane, for that is not a well defined thing to do. The goal isn’t even to look sane, for looking insane may be compelling, and looking sane to everyone all the time is probably impossible. For people in general… “don’t think about what’s sane, think about what works” is probably good advice to gesture towards the actual goal.
- Markvy 8 Dec 2025 2:20 UTC
  2 points
  7
  Parent
  Re: “For instance, if your basic sense of meaning has been anchored to a sense of the world persisting after you and you making some kind of mark on the world, you won’t know what to do with your life if there won’t be anything to make a mark on.”
  
  Presumably the thing to do then is to devote x% of your effort to saving the world.
- David J Higgs 10 Dec 2025 20:01 UTC
  1 point
  0
  Parent
  In addition to the option of spending effort on reducing the chance the world ends, one could also reframe from “leaving a mark on the world that outlives you” to “contributing to something bigger and beyond yourself.” The world is bigger than you, more important than you and exists outside of you right now, as well as up until the world ends (if/when it does).
  
  Helping the world right now, and helping the world after you are gone, are morally equivalent, and quite possibly equivalent at the level of fundamental physics. I’m not sure what, other than a false sense of personal immortality (legacy as something beyond the actual beneficial effects on the world), is tied to benefiting the world later than your own time of existence. But perhaps that’s my own ignorance.
FlorianH 7 Dec 2025 21:01 UTC
47 points
38
[There’s also a much more banal answer that I wouldn’t be surprised if it is a major, deep underlying driver, with all the interesting psychology provided in OP being some sort of half-conscious rationalization for our actual deep-rooted tendencies:] Not going insane simply is the very natural default outcome for humans even in such felt dire situation:
While shallowly it might feel like it would, going insane actually appears to me to NOT AT ALL be the default human reaction to an anticipation of (even a quite high probability of) the world ending (even very soon). I haven’t done any stats or research, but everything I’ve ever seen or heard of seems to suggest to me:
- While they’re not anywhere nearly the majority, still very many people have very high P(doom soon) yet stay nearly perfectly calm (at best you might call them insanely calm, given the [true or imagined] circumstances).
  - I think this applies to many people e.g. on this forum, but I’m reminded of much more ‘normal’ persons uttering even more dramatic ‘I’m sure AI might already TMORROW kill us all’ - all while simply going on with their usual lives.
- Slightly less 1:1 but imho still underlining our sanity’s resilience in closely as dire situations: Many people seem egoistic enough such that the ending of their own life to mean a very large part of the world they care about to be going to end, and yet they face many situations of more or less imminent death rather calmly as opposed to going insane
  - Extend to various cases where family and/or friends and/or tribe is facing extinction; at least I haven’t heard of them to usually be going insane by the prospect of not-yet-actually-visible but forthcoming extinction.
    Once a torturous way of you or your close ones being killed has actually started, that’s of course different, that’s when you go insane.
- Caleb Biddulph 7 Dec 2025 23:04 UTC
  22 points
  15
  Parent
  Makes sense. Surely there were many cases in which our ancestors’ “family and/or friends and/or tribe were facing extinction,” and going insane in those situations would’ve been really maladaptive! If anything, the people worried about AI x-risk have a more historically-normal amount of worry-about-death than most other people today.
  - Eliezer Yudkowsky 7 Dec 2025 23:43 UTC
    23 points
    16
    Parent
    They didn’t need to deal with social media informing them that they need to be traumatized now, and form a conditional prediction of extreme and self-destructive behavior later.
    - Nemoto 19 Dec 2025 15:49 UTC
      5 points
      0
      Parent
      There actually are rising cases of “mental ill-health” right now. Here in the UK, services are swamped. I’m sure some of this is due to an attitude change, in that people now refer to anything from a minor upset, or a slight difference in neurological function, to normal emotions such as grief, shame and regret, as a mental illness.
      
      Previously the attitude in the post-war generation was more like Truman’s toward Oppenheimer: “Blood on his hands; damn it, he hasn’t half as much blood on his hands as I have. You just don’t go around bellyaching about it”. (Robert Oppenheimer: A Life Inside the Center; by Ray Monk.)
      
      Bellyaching, cutting people out of one’s life due to some human imperfection, and turning to substance abuse, all seem to be excused under this mental health label, as if the difficulties the world is facing right now are already sufficient to trigger the insanity expectation.
      
      I’m grateful to you for this article, which I intend to take very seriously, because it provides the tools we need to arm ourselves against succumbing to stresses of every kind, as well as coping better with everyday life today and in the future, however long that might be.
- Nick_Tarleton 8 Dec 2025 1:03 UTC
  20 points
  9
  Parent
  A cynical theory of why someone might believe going insane is the default human reaction: weaponized incompetence, absolving them of responsibility for thinking clearly about the world, because they can’t handle the truth, and they can’t reasonably be expected to because no normal human can either.
- jmh 9 Dec 2025 14:02 UTC
  2 points
  0
  Parent
  I wonder if situations like the Cuban missile crisis are good examples for your position. But then I also wonder if that (I think apparently worried but calm about the world ending in a nuclear conflict) isn’t contrasted by the claims about the mass hysteria after the radio broadcast of Well’s War of the Worlds.
  - Martin Randall 11 Dec 2025 2:37 UTC
    3 points
    0
    Parent
    Apparently (edit: that particular case of) mass hysteria is a myth. But however many people got confused, I don’t think this is a contradiction. If I updated P(aliens are invading) from 0% to 1%, it would change my plans for the evening, because I am sane.
    - Nick_Tarleton 11 Dec 2025 4:57 UTC
      4 points
      2
      Parent
      that particular case of mass hysteria is a myth
David Gross 7 Dec 2025 14:22 UTC
46 points
19
“How are you coping with the end of the world?” journalists sometimes ask me… The journalist is imagining a story that is about me, and about whether or not I am going insane...
Seems too cynical. I can imagine myself as a journalist asking you that question not because I’m hoping to write a throw-away cliche of an article, but because if I take seriously what you’re saying about AGI risk, you’re on the cutting edge of coping with that, and the rest of us will have to cope with that eventually, and we might have an easier time of it if we can learn from your path.
- Eliezer Yudkowsky 7 Dec 2025 17:21 UTC
  22 points
  13
  Parent
  I would of course take the question very differently from a journalist who had otherwise dealt with that slight inconvenience of trying to get to grips with an idea, and started to seem worried; instead of having had the brilliant idea of writing a Relatable Character-Focused Story instead.
  
  Perhaps I overestimate how much I can deduce from tone and context, but to me it seems like there’s a visible departure from the norm for the person who becomes worried themselves and wonders “How will people handle it?” versus the kid visiting the zoo to look at the strange creatures who believe strange things.
- ChristianKl 12 Dec 2025 22:19 UTC
  6 points
  5
  Parent
  I think you underrate how much of the job of a journalist is about simplifying complex events into a narrative that’s easy to read and consume for the audience of the newspaper.
  - Eliezer Yudkowsky 21 Dec 2025 6:23 UTC
    10 points
    9
    Parent
    Why, that’s my job too! But it’s a very different job depending on whether you consider it an indispensable requirement to have people coming away with a roughly accurate picture of reality, or if your job is to be an entertainer.
Zach Stein-Perlman 7 Dec 2025 22:46 UTC
35 points
0
Context: Bay Area Secular Solstice 2025
romeostevensit 7 Dec 2025 17:43 UTC
35 points
4
All of this is not to be confused with the Buddhist doctrine that every form of negative internal experience is your own fault for not being Buddhist enough.
Not really, but it’s a long explanation and at this point I’m pretty sure some of the inference steps have to be confirmed by laborious trained processes. Nor is this process about reality (as many delusional Buddhists seem to insist), but more like choosing to run a different OS on ones hardware. The size of the task and the low probability of success makes it not worth the squeeze for many afaict. For the record, in case it is helpful to anyone at all, there are three types of dukkha, and painful sensations are explicitly the ones one can do nothing about (other than mundane skillful action). It is the dukkha of change (stuck priors) and the dukkha of fabrications (much more complicated) that Buddhist training eliminates.
But the thing I actually want to comment about is related to a point I’ve had a really hard time communicating to people about the deciding to be sane thing. It’s a kind of scale-free mental move where people seem to have a really hard time with self-reference, thinking it’s some sort of gotcha when it isn’t. Not quite on the level of ‘if you kill a murderer the number of murderers remains the same’ but close. Like ‘don’t negotiate with internal processes that are acting like terrorists’ must, in the limit, turn you into an internal terrorist. It seems motivated by a strong aversive distaste for any top down mental moves, because their training data for that kind of move was always used adversarially. For example, in school, to disrupt and gaslight their own sense making, learning function, and value seeking, rather than helping them cultivate their own. Thus people seem to have a deep prior to regard all such with suspicion and not engage with the idea that a non-horrible version of this move is available.
I’ve spent a lot of time with the self-therapy modality of Core Transformation for this reason as it seems to cut directly at it, and the short version is something I think that most people can see the value of, Humans Are Not Automatically Strategic style:
1. What is the situation I am confronting?
2. What are my beliefs about myself and the situation?
3. What are my attitudes and feelings about the situation?
4. What do I want to do (not necessarily what I can, or should do)?
5. For what purpose do I want that?
6. What would having that mean for me?
7. Recurse (5,6) until terminal goal is uncovered (if objections come up, rebase the stack on the objection)
8. Who wants that?
Credit to Opening the Heart of Compassion by Martin Lowenthal and Lar Short for this version. To me, this is a generator that eventually can help cut at the root of ‘unable to do recursive sanity checks’ as the moves are more deeply internalized and the internal processes come to trust the resultant structure more.
- TsviBT 8 Dec 2025 19:18 UTC
  7 points
  0
  Parent
  (I think I may have asked you a similar question before, sorry if I forgot your answer:) Are there a couple compelling examples of someone who
  1. did something you’d identify as roughly this procedure;
  2. then did something I’d consider impressive (like a science or tech or philosophy or political advance);
  3. and attributed 2 to 1?
  - romeostevensit 8 Dec 2025 22:21 UTC
    10 points
    0
    Parent
    Not directly attributable, no. I think of most of these things as bringing up the floor rather than raising the ceiling.
    - TsviBT 8 Dec 2025 22:28 UTC
      9 points
      3
      Parent
      
      I think of most of these things as bringing up the floor rather than raising the ceiling.
      
      Ohhhh ok. That’s helpful, thanks.
- Jonas Hallgren 8 Dec 2025 11:53 UTC
  3 points
  0
  Parent
  Nor is this process about reality (as many delusional Buddhists seem to insist), but more like choosing to run a different OS on ones hardware.
  (I kind of wanted to give some nuance on the reality part from the OS Swapping perspective. You’re of course right with some overzealous people believing they’ve found god and similar but I think there’s more nuance here)
  If we instead take your perspective of OS swap I would say it is a bit like switching from Windows to Linux because you get less bloatware. To be more precise one of the main parts of the swap is the lessening of the entrenchments of your existing priors. It’s gonna take you a while to set up a good distro but you will be less deluded as a consequence and also closer to “reality” if reality is the ability to see what happens with the underlying bits in the system. As a consequence you can choose from more models and you start interpreting things more in real time and thus you’re closer to reality, what is happening now rather than the story of your last 5 years.
  Finally on the pain of the swap, there are also more gradual forms of this, you can try out Ubuntu (mindfulness, loving kindness) before switching over. Seeing through your existing stories can happen in degrees, you don’t have to become enlightened to enjoy the benefits?
  - romeostevensit 8 Dec 2025 15:35 UTC
    10 points
    0
    Parent
    This is an appealing story, but I haven’t really observed anyone get noticeably better at epistemology as a result of their practice. I remain confused about this for similar reasons to this story.
    - Kaj_Sotala 10 Dec 2025 11:27 UTC
      2 points
      0
      Parent
      I think part of the issue is that epistemology is largely a question of mindware, and practice does not fix missing or bad mindware any more than it can teach a person calculus if they’ve never studied it.
dr_s 7 Dec 2025 11:54 UTC
34 points
5
This is why, in a much more real and also famous case, President Truman was validly angered and told “that son of a bitch”, Oppenheimer, to fuck off, after Oppenheimer decided to be a drama queen at Truman. Oppenheimer was trying to have nuclear weapons be about Oppenheimer’s remorse at having helped create nuclear weapons. This feels obviously icky to me; I would not be surprised if Truman felt very nearly the same.
I did sympathise with Truman in the way that scene is portrayed in Nolan’s movie more than most seem to have (or even, that the movie intended to). But I am not sure that wasn’t just Truman making the bombs about him instead—he made the call after all, it was his burden to bear. Which again sort of shifts it from it being about, you know, the approximately 200k civilians they killed and stuff.
- David Joshua Sartor 8 Dec 2025 1:15 UTC
  57 points
  6
  Parent
  Truman only made the call for the first bomb; the second was dropped by the military without his input, as if they were conducting a normal firebombing or something. Afterward, he cancelled the planned bombings of Kokura and Niigata, establishing presidential control of nuclear weapons.
  - Eliezer Yudkowsky 10 Dec 2025 20:15 UTC
    18 points
    3
    Parent
    ...amazing.
    - Garrett Baker 10 Dec 2025 23:27 UTC
      19 points
      2
      Parent
      There is also recent debate about whether Truman was even well informed about the fact that Hiroshima was a city rather than a “purely military target”, eg see the book The Most Awful Responsibility, well reviewed by many including Richard Rhodes, as well as the excellent interview with the author by Dan Carlin.
  - David J Higgs 9 Dec 2025 20:45 UTC
    9 points
    0
    Parent
    Huh, I knew there wasn’t the sort of plan you’d naively expect where the US gov/military command observes the response of the Japanese gov/military to one of their cities being destroyed by unthinkable godlike powers and then decides what to do next. I didn’t know that president Truman literally didn’t know about/have implicit preemptive control over the 2nd bombing.
    - Chris Wintergreen 10 Dec 2025 18:40 UTC
      15 points
      0
      Parent
      Dan Carlin recently did a Hardcore History Addendum show about Truman called Atomic Accountability. It was an interview with Alex Wellerstein who brings into question how much Truman actually knew about the location of the first bomb being dropped. Truman (possibly) thought that rulling out Kyoto (which was number one on the list), meant he was ruling out cities as targets, and didn’t know Hiroshima was a city. This seems wild, until you factor in how all the information is being fed to him, how long he’d known about the nuclear program and what the competing military interests were. Worth a listen if you’re into the topic as it’s a new perspective.
      - Garrett Baker 10 Dec 2025 23:29 UTC
        5 points
        0
        Parent
        The book in which Alex Wellerstein really makes the case was also released yesterday, buy it here!
      - David Joshua Sartor 11 Dec 2025 4:11 UTC
        4 points
        0
        Parent
        Thanks, I hadn’t seen this.
        I agree Truman thought Hiroshima was mostly a military base. IIRC you can see him make basic factual errors to that effect in an early draft of a speech.
  - Nathan Young 13 Dec 2025 11:06 UTC
    2 points
    0
    Parent
    I take Perplexity to be about 80% accurate, but this suggests the above isn’t accurate. He had signed off on multiple bombs and didn’t stop the second between the 6th and 9th when he could have.
    - David Joshua Sartor 15 Dec 2025 19:17 UTC
      1 point
      0
      Parent
      My previous statements are technically correct, and IMO mostly make a correct point in context (that Truman had not realized, at the time, the immediate consequences of his decision), but are somewhat misleading. Thanks.
      The process was still stupid, and not what Truman would have preferred. Truman was surprised and disturbed by the second bomb being dropped so quickly. But it seems like it wouldn’t have been too hard for him to anticipate and prevent this outcome, if he had been paying more attention (the same way he thought Hiroshima was a military base due to his own deficit of curiosity); I hadn’t realized that before, thanks.
Cole Wyeth 7 Dec 2025 4:01 UTC
31 points
13
I have no plans to go insane, but I’m certainly pretty anxious about everyone dying.
- Wei Dai 8 Dec 2025 2:07 UTC
  5 points
  −8
  Parent
  Try applying:
  - Is the potential astronomical waste in our universe too small to care about?
  - Shut Up and Divide?
  Also recall that we’re in a tiny tiny corner of Reality (whatever Tegmark level it is, it’s probably much larger than what we can see), and it’s pretty unclear how to update EU(Reality | human history).
  - Cole Wyeth 8 Dec 2025 23:16 UTC
    6 points
    4
    Parent
    I don’t believe in large mathematical multiverses.
    - Wei Dai 9 Dec 2025 0:03 UTC
      7 points
      4
      Parent
      Do you believe in a quantum multiverse, or a spatially infinite universe (beyond the observable universe)? You can get a similar conclusion with either of these (which are Tegmark Levels 3 and 1, respectively).
      - Cole Wyeth 9 Dec 2025 0:58 UTC
        9 points
        1
        Parent
        More plausible, somewhat comforted that some branches could survive. However, my brain works by caring about what I can effect and observe. For instance, this kind of argument is not going to make me less worried about S-risks (or just personally being tortured) or like, even my friends and family dying.
        internetexplorer 10 Dec 2025 1:50 UTC
        16 points
        0
        Parent
        Hey Cole! I also went through a period of feeling pretty worried about s-risks, and have recently come out the other side. If you’d like someone to talk to, or even any advice re: any materials you might find helpful for coming to accept/loosen the grip of fear and anxiety, my inbox is open (I’m a clinical psych PhD student and have lots of resources for existential/humanist therapy, compassion-focused therapy, CBT, DBT, etc.). I’ve probably read a lot of what you’re worried about, so you don’t need to worry about having any hazardous effect on me :)
        Also, I’d love to learn more from you about your research! I like your posts.
        VlaakithOutrance 13 Jan 2026 16:42 UTC
        2 points
        0
        Parent
        I’d like to publicly take you up on the offer of sharing “any materials you might find helpful for coming to accept/loosen the grip of fear and anxiety”. Do you have recommendations that would likely benefit most people who deal with anxiety?
        internetexplorer 13 Jan 2026 22:51 UTC
        2 points
        0
        Parent
        Hi! Yes :) I think a good framework for working on anxiety on your own is Self-Compassion Therapy (SCT). I like SCT for existential anxiety in particular because its success doesn’t hinge on your ability to change your external circumstances and it doesn’t presuppose your degree of worry is disproportionate relative to the “actual threat” posed by the object of your worry. Here are some exercises published by Kristin Neff, a well-regarded self-compassion researcher/practitioner: https://self-compassion.org/self-compassion-practices/. There are also lots of Mindfulness-Based Stress Reduction (MBSR) guided meditations online, e.g., https://www.jeffersonhealth.org/conditions-and-treatments/mindfulness-based-stress-reduction/mindfulness/mbsr-guided-practices—I’d look into the body scans to start, as anxiety often manifests as muscle tension and intervening on muscle tension can indirectly alleviate anxiety.
        If you have access to insurance coverage for therapy, I would additionally recommend looking into Emotion-Focused Therapy (EFT) and Cognitive Behavioural Therapy (CBT). I’m partial to EFT overall for most clinical presentations, though I like CBT for social anxiety as I think social exposures can be quite powerful.
        Outside of targeted mental health interventions, I’d recommend making time for loved ones, community, fun, creative play, exercise, etc. and limiting exposure to anxiety-provoking stimuli where possible. This may be obvious but it’s easy to forget about the basics.
- Davidmanheim 8 Dec 2025 16:49 UTC
  3 points
  0
  Parent
  Is this anxiety in the typical form of making it harder for you to do other things? Because yes, we all agree that it’s very bad outcome, but a critical point of the post is that you might want to consider ways to not do the thing that makes your life worse and doesn’t help.
  - Cole Wyeth 8 Dec 2025 19:15 UTC
    5 points
    0
    Parent
    It would be better if I were less anxious (though perhaps, not zero).
    I guess I’m just claiming that this is probably not a matter of being dramatic etc. For instance, I used to read the Precipice before bed and had trouble sleeping. My girlfriend had to point out to me that maybe it was because of the Precipice (it didn’t consciously occur to me at all). I stopped reading it and slept fine again.
    - TristanTrim 8 Dec 2025 19:46 UTC
      5 points
      1
      Parent
      Did you read the Precipice during the day instead? I’d hate if the parable here was “avoid thinking about things you find stressful”. The parable “pay attention to your somatic experience and don’t mess up your circadian rhythm and wellbeing by dumping anxiety into your system before trying to sleep” is pretty good though.
      - Cole Wyeth 8 Dec 2025 21:22 UTC
        4 points
        0
        Parent
        ....no
        TristanTrim 8 Dec 2025 23:03 UTC
        2 points
        0
        Parent
        Haha… well it looks by your profile you’re still managing to think about things you find stressful. “chances of AGI in the next few years are high enough (though still <50%) that it’s best to focus on disseminating safety relevant research as rapidly as possible”… so no problems there. Hope my comment didn’t come across as mean.
        
        Also you’re advised by Marcus Hutter? That’s cool! I got a copy of “Universal Artificial Intelligence” I want to get to reading sometime. Could I DM you and talk about UAI sometime?
        Cole Wyeth 8 Dec 2025 23:15 UTC
        4 points
        0
        Parent
        Sure, anytime. I also organize the AIXI research community here: https://uaiasi.com
        There is a reading group on the newer one “an introduction to UAI” running now (mostly finished but maybe we’ll start another round). The old book still has advantages.
    - Davidmanheim 8 Dec 2025 20:17 UTC
      3 points
      0
      Parent
      Agree that it’s not just about being dramatic / making the problem about you. But that was only one of the points Eliezer made about why people could fail at this in ways that are worth trying to fix. And in your case, yes, dealing with the excessive anxiety seems helpful.
      - Cole Wyeth 8 Dec 2025 21:20 UTC
        2 points
        0
        Parent
        For sure, but nothing in this post seems directly helpful with the problem I’m describing?
        Davidmanheim 8 Dec 2025 21:35 UTC
        3 points
        0
        Parent
        “Actual LessWrong readers also sometimes ask me how I deal emotionally with the end of the world.
        I suspect a more precise answer may not help. But Raymond Arnold thinks I should say it, so I will say it.
        I say again, I don’t actually think my answer is going to help.”
        Cole Wyeth 8 Dec 2025 21:41 UTC
        2 points
        0
        Parent
        I don’t think there’s any disagreement here.
Linch 8 Dec 2025 19:01 UTC
22 points
4
The third way I stay sane is a fiat decision to stay sane.
My mental landscape contains that option; I take it.
This is the point I am even less expecting to be helpful, or to correspond to any actionable sort of plan for most readers.
Some years ago, I had a friend who told me she was still anorexic even though the reason she originally acquired anorexia no longer applies^[1].
I responded “Have you considered not being anorexic?” She thought about it and replied something like “No, actually.”
Two weeks later she thanked me for helping to cure her anorexia.
This is the type of advice that I expect to be profoundly unhelpful to >95% of people in that position (and indeed is rightfully lampooned approximately everywhere). Yet it was the exact thing this specific person needed to hear, and hopefully “you can just decide to stay sane” is the exact thing some small fraction of people reading your post needed to hear as well.
1. ^
  (censoring the exact reason)
- gwillen 11 Dec 2025 0:49 UTC
  7 points
  0
  Parent
  Someone mentioned “mass hysteria” above. I think there are cases where, surrounded by a certain culture or context, people feel positive-tribal-emotions about going insane. If that’s true, it seems perhaps quite helpful—to some particular people, in some particular context—for a Big Tribal Leader (or a friend!) to say, “I strongly recommend not going insane! To the extent that this seems interpretable as a choice, I strongly recommend choosing the other thing!”
  - Linch 11 Dec 2025 1:57 UTC
    2 points
    0
    Parent
    Interesting, are there examples of people feeling positive-tribal-emotions to “go insane” in the abstract?
    I suspect in practice it looks more like social pressure to be sleep-deprived, social pressure to repeat what the Glorious Leader says without question, pressure to ignore widely held taboos, pressure to sacrifice the self, etc.
    - gwillen 14 Feb 2026 22:44 UTC
      2 points
      0
      Parent
      (Long-delayed response, because I’m not good at staying on top of my notifications, sorry.)
      
      I think by “go insane”, what I meant is things like:
      
      believe things that are harmful, counterproductive, or nonsensical, which people around you also believe / support you in believing;
      do things which are harmful, counterproductive, or nonsensical, which people around you also do / support you in doing.
      
      So, yeah, specific things and not insanity in the abstract, although not quite the same sort of specific things I think you were pointing at. In the context of Eliezer’s speech, he did give a slightly more specific interpretation:
      
      I will not write internal scripts which say that I am supposed to / pseudo-predict that I will, do any particular stupid or dramatic thing in response to the end of the world approaching visibly nearer in any particular way.
      
      In other words, if people around you are freaking out unproductively about the end of the world, don’t take that as social license (or mandate!) to freak out unproductively about the end of the world. I (Eliezer) here provide you with social license to instead not do that.
Sabiola 7 Dec 2025 12:05 UTC
19 points
5
Errors vs. Bugs and the End of Stupidity is a great post about “skill issues”.
AprilSR 7 Dec 2025 6:48 UTC
19 points
−1
Wow, this sure is a much clearer way to look at the self-pseudo-prediction/action-plan thingy than any I’ve seen laid out before.
- niplav 7 Dec 2025 20:54 UTC
  2 points
  −1
  Parent
  I got Claude to read this text and explain the proposed solution to me ^[[1]] , which doesn’t actually sound like a clean technical solution to issues regarding self-prediction, did Claude misexplain or is this an idiosyncratic mental technique & not a technical solution to that agent foundations problem?
  
  C.f. Steam (Abram Demski, 2022), Proper scoring rules don’t guarantee predicting fixed points (Caspar Oesterheld/Johannes Treutlein/Rubi J. Hudson, 2022) and the follow-up paper, Fixed-Point Solutions to the Regress Problem in Normative Uncertainty (Philip Trammell, 2018), active inference which simply bundles the prediction and utility goal together in one (I find this ugly (I didn’t read these two comments before writing this one, so the distaste for active inference was developed independently)).
  
  I guess this was also talked about in Embedded Agency (Abram Demski/Scott Garrabrant, 2020) under the terms “action counterfactuals”, “observation counterfactuals”?
  
  Claude 4.5 Sonnet explanation
  Your brain has a system that generates things that feel like predictions but actually function as action plans/motor output. These pseudo-predictions are a muddled type in the brain’s type system.
  
  You can directly edit them without lying to yourself because they’re not epistemic beliefs — they’re controllers. Looking at the place in your mind where your action plan is stored and loading a new image there feels like predicting/expecting, but treating it as a plan you’re altering (not a belief you’re adopting) lets you bypass the self-prediction problem entirely.
  
  So: “I will stay sane” isn’t an epistemic prediction that would create a self-fulfilling prophecy loop or violate the belief-action firewall. It’s writing a different script into the pseudo-model that connects to motor output — recognizing that the thing-that-feels-like-a-prediction is actually the controller, and you get to edit controllers.
  1. ↩︎
    I didn’t want to read a bunch of unrelated text from Yudkowsky about a problem I don’t really have.
  - Algon 7 Dec 2025 23:24 UTC
    7 points
    2
    Parent
    It is an idiosyncratic mental technique. Look up trigger action plans, say. What you’re doing there is a variant of what EY describes.
    - niplav 7 Dec 2025 23:54 UTC
      5 points
      0
      Parent
      I fortunately know of TAPs :-) (I don’t feel much apocalypse panic so I don’t need this post.)
      
      I guess I was hoping there’d be some more teaching from up high about this agent foundations problem that’s been bugging me for so long, but I guess I’ll have to think for myself. Fine.
  - AprilSR 8 Dec 2025 3:40 UTC
    4 points
    0
    Parent
    Yeah I’m pretty sure it’s an idiosyncratic mental technique / human psychology observation, there isn’t technical agent foundations progress here.
Eli Tyre 7 Dec 2025 22:41 UTC
15 points
0
To be clear, I actually do this very rarely
Why do you only do it very rarely? Is there a non-obvious cost?
- Eliezer Yudkowsky 7 Dec 2025 23:44 UTC
  25 points
  6
  Parent
  It’s fancy and indirect, compared to getting out of bed.
J Bostock 7 Dec 2025 15:55 UTC
13 points
4
In what sense are you using “sanity” here? You normally place the bar for sanity very high, like ~1% of the general population high. A big chunk of people I’ve met in the UK AI risk scene I would call $s a n e_{j b}$ . Does $s a n e_{e l i e z e r}$ mean?
1. You are $s a n e_{e l i e z e r}$ iff you avoid totally crashing out, being unable to hold down a job, panicking or crying most of the time, threatening people
2. You are $s a n e_{e l i e z e r}$ iff you do the stuff in 1 and you’re able to think about AI without making stupid errors, knowing the limits of your own reasoning about the topic
3. You are $s a n e_{e l i e z e r}$ iff you do the stuff in 2 and you reliably perform (or could perform) $n e t p o s i t i v e_{e l i e z e r}$ work reducing doom
4. You are $s a n e_{e l i e z e r}$ iff you do the stuff in 3 and you also have a $b a s i c a l l y f u l l y a c c u r a t e_{e l i e z e r}$ model of the AI doom situation
- Eliezer Yudkowsky 7 Dec 2025 17:24 UTC
  22 points
  19
  Parent
  This is about “insane” in the sense of people ceasing to meet even their own low bars for sanity.
  What links here?
  - Caleb Biddulph's comment on Caleb Biddulph’s Shortform by Caleb Biddulph (13 Dec 2025 23:30 UTC; 9 points)
megasilverfist 28 Dec 2025 14:54 UTC
9 points
2
I decided to try the bed inertia technique and that solved my bed inertia despite not doing the technique. I don’t claim to have enough introspection to know why but very naively chucking ‘try this technique’ at my head worked for bed inertia despite failing at the actual technique.
sarahconstantin 11 Dec 2025 19:26 UTC
8 points
0
I liked your account of:
looking in the internal direction of your motor plans, and writing into your pending motor plan the image of you getting out of bed in a few moments, and then letting that image get sent to motor output and happen.
I do a similar sort of thing myself sometimes, and similarly do not think it is the same as predictive processing theory, which I don’t believe in for several reasons.
I would call that “visualization”, and I’d say that it’s not hyperstition/woo because it’s not believing in a prediction, it’s forming a plan. (Except that the actually-effective way of forming a plan, the kind you’ll actually be significantly more likely to do than most “plans”, involves vividly simulating the actions you’re going to take.)
A part of this technique involves being really honest with yourself about whether you have any intention of doing the thing, and if you don’t, don’t even try to do the visualization exercise. You have to be able to feel the difference between “yeah I’m gonna do it” and “nope, don’t want to.”
So to the particular goal here, I have rarely ever seriously held an intention to be “sane”, to “cope well,” or to “avoid drama”. to the extent that I have seriously wanted to be a bit better on that axis, I succeeded. It’s not quite as simple as “anyone can do it just by wanting to”, but rather “if you don’t want to, it will definitely not happen, and at least some people, n=1, don’t particularly want to.”
Double 8 Dec 2025 18:44 UTC
8 points
0
I reflected on why I didn’t feel overwhelming debilitating sadness due to x-risk and realized that “there’s no rule that says you should be sad if you aren’t feeling sad.”
Even a recent widow in a previously happy marriage shouldn’t feel bad about not feeling sad if they find themselves not being sad.
Kronopath 24 Dec 2025 1:55 UTC
7 points
2
Can I ask a few things that might clarify that last part?
Do you just not get somewhat impulsive or (non-clinically) intrusive thoughts when pondering a situation (any situation) where things go bad? Like, in cases where things are (partially) outside your control, where your mind tries to come up with myriad solutions to the problem regardless, some of which are less than tasteful?

Because I suspect part of what the journalists are asking you is “How do you not fall into a pit of depression?” but a plausible other part is “How do you not end up taking actions well outside the Overton window in an attempt to ‘make a desperate effort’?”
I think this is what the kind of people who accuse you of incentivizing violence are kind of thinking. Part of it is operating (as Zvi calls it) at the wrong simulacra levels, where statements about the state of reality are taken not literally but as a call to action. But I think part of it is “If I were convinced the barbarians were at our gates, I’d be fighting as many of them off as I can. But this man is convinced that the barbarians are in fact at our gates. Why isn’t he picking up a sword?”
If the answer is “I’m not convinced doing so would actually make an appreciable dent, and it would come at the cost of my soul,” then frankly that should probably be part of your answer to those journalists.
(Apologies if it already is, I haven’t kept up with all your recent media appearances.)
- Eliezer Yudkowsky 24 Dec 2025 16:28 UTC
  13 points
  0
  Parent
  It cannot be answered that simply to the Earthlings, because if you answer “Because I don’t expect that to actually work or help”, some of them and especially the more evil ones will pounce in reply, “Aha, so you’re not replying, ‘I’d never do that because it would be wrong and against the law’, what a terrible person you must be!”
  - Kronopath 6 Mar 2026 15:41 UTC
    1 point
    0
    Parent
    Not having a lot of experience with PR, it feels like you could still make it work if you emphasize the immortality first. If you can clarify the journalist’s question enough for them to say something like “Most other people would be doing something drastic or crazy or evil in this situation,” you could respond with something like:
    “Okay, let’s say I decide to [do something drastic or evil]. I’d hate doing it and I’d immediately turn into a villain, but fine. What happens next? [Break down a likely scenario showing how it wouldn’t work.] So I’d turn myself evil, trigger a backlash against AI Safety, and we’d end up in a worse position than where I started. It’s not worth it.”
    I don’t think doing so would even be dishonest. You’ve argued for people to be careful with utilitarianism, to take a half-step towards it and then stop because we’re running on corrupted hardware that makes it tempting to engage in motivated reasoning. This feels a lot like compensating for that.
HoVY 19 Dec 2025 15:50 UTC
7 points
5
“I was rolling my eyes about how they’d now found a new way of being the story’s subject”
That reads to me like it’s still rolling eyes at a status overreach, just a slightly different one than the one most people would roll their eyes at
Ben Pace 18 Dec 2025 23:56 UTC
7 points
0
Curated. It helps to read accounts of how other people aren’t wrecked by the current state of the world, especially someone who has a good model of their mental world and who has been dealing with this longer than most people. And there’s lots of interesting things here, about being genre-savvy, the asides on predictions vs plans, and the Irori motto and image (which I love).
JenniferRM 9 Dec 2025 8:21 UTC
7 points
0
Sanity has numerous indicators.
For example, when paranoid crazy people talk about the secret courts that control the spy machines, they don’t provide links to wikipedia, but I do! This isn’t exactly related, but if you actually have decent security mindset then describing real attacks and defenses SOUNDS crazy to normies, and for PR purposes I’ve found that it is useful to embrace some of that, but disclaim some of it, in a mixture.
I’m posting this on “Monday, December 8th” and I wrote that BEFORE looking it up to make sure I remembered it correctly and crazy people often aren’t oriented to time.
When I go out of the house without combed hair and earrings BY ACCIDENT, I eventually notice that I’m failing a grooming check, and fix it, avoiding a non-trivial diagnostic indicator for mood issues. If I fail more than one day in a row, it is time to eat an 8oz medium rare ribeye and go swing dancing.
(The above two are habits I installed for prosaic mental health reasons, that I want to persist deep into old age because I want them to be habitual and thus easy to deploy precisely in the sad situation when they might be needed.)
I was recently chatting with a friend about the right order in which to remove things from one’s emergency hedonic bucket list...
I would feel really really silly if all the self driving cars wake up one day and start running people over, and the surprise submarines pop up out of the water and release enough drones to kill everyone 10 times over, and I haven’t even tried cocaine ONCE.
The response was great!
You know that thing where the spies would supposedly carry cyanide pills in case they’re caught? Like that, but with coke :)
I’m thinking of adding that to me purse. And so long as I stay sane, then, assuming the Terminators murder me by a method that gives me enough time to realize what’s happening and react effectively, when the drone takes me out I will be well dressed, know what the date is, AND be high on cocaine! Lol!
Eating dinner with family is another valid way to go, if you have a few days or weeks of warning. Having such meals in advance and calling them Prepsgiving doesn’t seem crazy to me, for a variety of reasons.
Honestly though I expect the end to be more like what happens in Part 1 of Message Contains No Recognizable Symbols where almost literally no one on Earth notices what happened, probably including me, and so it won’t be dramatic at all… but I’ll still be dressed OK probably, and know what day it is, and go out with a feeling like “See! ASI didn’t even happen, and it was all a bunch of millennialist eschatology, like Global Warming, and Peak Oil and Y2K before that… and Killer Bees and Nuclear War and all those other things that seemed real but never caused me any personal harm”. But also… it will have been avoidable, and there is an OBJECTIVE sadness to that, even is I don’t predict a noticeable subjective reaction in timelines like that.
Ultimately, as I’ve said before:
If you have a good plan for how [weeping like] that could help then I might be able to muster some tears? But I doubt it shows up as a step in winning plans.
- HoVY 20 Dec 2025 13:29 UTC
  6 points
  3
  Parent
  and crazy people often aren’t oriented to time.
  Note that the reference is specifically in regards to older patients and for diagnosing a specific form of “crazy”. Does it generalize to all forms of “crazy”? I don’t know, I haven’t looked into it at all. I just was curious and wanted to read the citation, and thought it was worth noting.
  From the conclusion: “Disorientation to time is a useful guide to the presence and severity of dementia or delirium in older hospital patients.”
  - JenniferRM 22 Dec 2025 0:00 UTC
    3 points
    0
    Parent
    A valid footnote! Yes!
    Part of why I adopted the practice is that (1) maybe robots won’t kill us all before we become elderly, and (2) maybe a good Singularity won’t happen and cure all those diseases and so (3) in the middle path there is probably some value, as a hedge, to practice “habits that will make being alive and yet demented decades in the future much much less bad”...
    Plausibly: as the more recent year’s memories and skills are ablated from the brain’s contents via degeneration, material from previous decades (that is “what will likely last longer”) can resurface to guide behavior… and so that material can be shaped in advance to be helpful.
    It is a little bit weird (but not super weird) that there are many ways of being crazy, and many indicators to monitor and/or maintain to help keep an even keel.
- JenniferRM 19 Dec 2025 10:29 UTC
  3 points
  0
  Parent
  Update: I went swing dancing and am full of bliss again.
James_Miller 7 Dec 2025 16:09 UTC
7 points
1
I teach a course at Smith College called the economics of future technology in which I go over reasons to be pessimistic about AI. Students don’t ask me how I stay sane, but why I don’t devote myself to just having fun. My best response is that for a guy my age with my level of wealth giving into hedonism means going to Thailand for sex and drugs, an outcome my students (who are mostly women) find “icky”.
- StanislavKrym 7 Dec 2025 17:10 UTC
  7 points
  0
  Parent
  I strongly suspect that the answer stems from historical analogies. The equivalent of doom was related to catastrophes like epidemics, natural disasters, genocide-threatening wars and destruction of the ecosystem. Genocide-threatening wars could motivate individuals to weaken the aggressive collective as much as possible (so that said collective would either think twice before starting the war or commiting genocide or have a bigger chance of being outcompeted). Epidemics, natural disasters and gradual destruction of the ecosystem historically left survivors who would keep the culture afloat and could even be motivated by it.
  AI-related imminent doom would be most equivalent to genocide of mankind and likely to deserve a similar response, which is minimising p(doom), helping those who work on it or at least doing the work which benefitted the society and was expected from you had it not been for imminent doom.
  It could be also useful to consider the counterfactual possibility of an unavoidable gamma-ray burst that was predicted to wipe the Earth out. The GRB would require the civilisation to build bunkers and to preserve the ecosystem. Even if nearly every individual is unlikely to actually enter the bunker, living a life of debauchery could be a bad decision due to acausal trade or actively motivating others to do the same and indirectly undermining the chance of mankind to survive.
Chris_Leong 7 Dec 2025 8:27 UTC
7 points
1
Even if they had almost destroyed the world, the story would still not properly be about their guilt or their regret, it would be about almost destroying the world. This is why, in a much more real and also famous case, President Truman was validly angered and told “that son of a bitch”, Oppenheimer, to fuck off, after Oppenheimer decided to be a drama queen at Truman. Oppenheimer was trying to have nuclear weapons be about Oppenheimer’s remorse at having helped create nuclear weapons. This feels obviously icky to me; I would not be surprised if Truman felt very nearly the same.
Fascinating, I always interpreted this as Truman being an asshole, but I guess that makes sense now that you explain it that way. I suppose a meeting with the president is precisely the wrong time to focus on your own guilt as opposed to trying to do what you can to steer the world towards positive outcomes.
One of the ways you can get up in the morning, if you are me, is by looking in the internal direction of your motor plans, and writing into your pending motor plan the image of you getting out of bed in a few moments, and then letting that image get sent to motor output and happen
Was this inspired by active inference?
- Eliezer Yudkowsky 7 Dec 2025 17:25 UTC
  16 points
  0
  Parent
  The technique is older than the “active inference” malarky, but the way I wrote about it is influenced by my annoyance with “active inference” malarky.
  What links here?
  - niplav's comment on Eliezer’s Unteachable Methods of Sanity by Eliezer Yudkowsky (7 Dec 2025 20:54 UTC; 2 points)
- Richard_Kennaway 7 Dec 2025 13:56 UTC
  4 points
  −11
  Parent
  
  Was this inspired by active inference?
  
  I wondered the same thing. I’m not a fan of the idea that we do not act, merely predict what our actions will be and then observe the act happening of itself while our minds float epiphenomenally above, and I would be disappointed to discover that the meme has found a place for itself in Eliezer’s mind.
  - Eliezer Yudkowsky 7 Dec 2025 17:30 UTC
    27 points
    −1
    Parent
    Oh, absolutely not. Our incredibly badly designed bodies do insane shit like repurposing superoxide as a metabolic signaling molecule. Our incredibly badly designed brains have some subprocesses that take a bit of predictive machinery lying around and repurpose it to send a control signal, which is even crazier than the superoxide thing, which is pretty crazy. Prediction and planning remain incredibly distinct as structures of cognitive work, and the people who try to deeply tie them together by writing wacky equations that sum them both together plus throwing in an entropy term, are nuts. It’s like the town which showed a sign with its elevation, population, and year founded, plus the total of those numbers. But one reason why the malarky rings true to the knowlessones is that the incredibly badly designed human brain actually is grabbing some bits of predictive machinery and repurposing them for control signals, just like the human metabolism has decided to treat insanely reactive molecular byproducts as control signals. The other reason of course is the general class of malarky which consists of telling a susceptible person that two different things are the same.
    What links here?
    niplav's comment on Eliezer’s Unteachable Methods of Sanity by Eliezer Yudkowsky (7 Dec 2025 20:54 UTC; 2 points)
    - AnnaSalamon 10 Dec 2025 15:33 UTC
      24 points
      12
      Parent
      Prediction and planning remain incredibly distinct as structures of cognitive work,
      I disagree. (Partially.) For a unitary agent who is working with a small number of possible hypotheses (e.g., 3), and a small number of possible actions, I agree with your quoted sentence.
      But let’s say you’re dealing with a space of possible actions that’s much too large to let you consider each exhaustively, e.g. what blog post to write (considered concretely, as a long string of characters).
      It’d be nice to have some way to consider recombinable pieces, e.g. “my blog post could include idea X”, “my blog post could open with joke J”, “my blog post could be aimed at a reader similar to Alice”.
      Now consider the situation as seen by the line of thinking that is determining: “should my blog post be aimed mostly at readers similar to Alice, or at readers similar to Bob?”. For this line of thinking to do a good estimate of ExpectedUtility(post is aimed at Alice), it needs predictions about whether the post will contain idea X. However, for the line of thinking that is determining whether to include idea X (or the unified agent, at those moments when it is actively considering this), it’’ll of course need good plans (not predictions) about whether to include X, and how exactly to include X.
      I don’t fully know what a good structure is for navigating this sort of recombinable plan space, but it might involve a lot of toggling between “this is a planning question, from the inside: shall I include X?” and “this is a prediction question, from the outside: is it likely that I’m going to end up including X, such that I should plan other things around that assumption?”.
      My own cognition seems to me to toggle many combinatorial pieces back and forth between planning-from-the-inside and predicting-from-the-outside, like this. I agree with your point that human brains and bodies have all kinds of silly entanglements. But this part seems to me like a plausible way for other intelligences to evolve/grow too, not a purely one-off humans idiosyncrasy like having childbirth through the hips.
      What links here?
      AnnaSalamon's comment on Believing In by AnnaSalamon (15 Jan 2026 2:59 UTC; 15 points)
      - TsviBT 10 Dec 2025 16:30 UTC
        16 points
        6
        Parent
        In this example, you’re trying to make various planning decisions; those planning decisions call on predictions; and the predictions are about (other) planning decisions; and these form a loopy network. This is plausibly an intrinsic / essential problem for intelligences, because it involves the intelligence making predictions about its own actions—and those actions are currently under consideration—and those actions kinda depend on those same predictions. The difficulty of predicting “what will I do” grows in tandem with the intelligence, so any sort of problem that makes a call to the whole intelligence might unavoidably make it hard to separate predictions from decisions.
        
        A further wrinkle / another example is that a question like “what should I think about (in particular, what to gather information about / update about)”, during the design process, wants these predictions. For example, I run into problems like:
        
        I’m doing some project X.
        I could do a more ambitious version of X, or a less ambitious version of X.
        If I’m doing the more ambitious version of X, I want to work on pretty different stuff right now, at the beginning, compared to if I’m doing the less ambitious version. Example 1: a programming project; should I put in the work ASAP to redo the basic ontology (datatypes, architecture), or should I just try to iterate a bit on the MVP and add epicycles? Example 2: an investigatory blog post; should I put in a bunch of work to get a deeper grounding in the domain I’m talking about, or should I just learn enough to check that the specific point I’m making probably makes sense?
        The question of whether to do ambitious X vs. non-ambitious X also depends on / gets updated by those computations that I’m considering how to prioritize.
        
        Another kind of example is common knowledge. What people actually do seems to be some sort of “conjecture / leap of faith”, where at some point they kinda just assume / act-as-though there is common knowledge. Even in theory, how is this supposed to work, for agents of comparable complexity* to each other? Notably, Lobian handshake stuff doesn’t AFAICT especially look like it has predictions / decisions separated out.
        
        *(Not sure what complexity should mean in this context.)
        What links here?
        Why does Eliezer make abrasive public comments? by k64 (22 Dec 2025 16:45 UTC; 97 points)
        AnnaSalamon 11 Dec 2025 0:22 UTC
        12 points
        2
        Parent
        A further wrinkle / another example is that a question like “what should I think about (in particular, what to gather information about / update about)”, during the design process, wants these predictions.
        Yes; this (or something similar) is why I suspect that “‘believing in’ atoms” may involve the same cognitive structure as “‘believing in’ this bakery I am helping to create” or “‘believing in’ honesty” (and a different cognitive structure, at least for ideal minds, from predictions about outside events). The question of whether to “believe in” atoms can be a question of whether to invest in building out and maintaining/tuning an ontology that includes atoms.
        Mateusz Bagiński 11 Dec 2025 12:30 UTC
        2 points
        0
        Parent
        (FYI, I initially failed to parse this because I interpreted “‘believing in’ atoms” as something like “atoms of ‘believing in’”, presumably because the idea of “believing in” I got from your post was not something that you typically apply to atoms.)
        David Joshua Sartor 15 Feb 2026 1:32 UTC
        1 point
        0
        Parent
        “Cooperate iff I prove my partner cooperates with me” cooperates with itself, by Lob. “Defect iff I prove my opponent defects against me” defects against itself, by Lob. The first beats the latter in a direct contest as well (even with inequal compute, IIRC).
        What is the human equivalent?
        TsviBT 15 Feb 2026 1:47 UTC
        3 points
        0
        Parent
        I don’t think humans have an equivalent in the sense of actually computing anything really Lobian. If you mean, equivalent in the sense of playing the same role, then I vaguely think it’s something like what I gestured at by saying:
        
        What people actually do seems to be some sort of “conjecture / leap of faith”, where at some point they kinda just assume / act-as-though there is common knowledge
        
        See some related discussion here: https://tsvibt.blogspot.com/2025/11/constructing-and-coordinating-around.html#flattening-levels-of-recursive-knowledge-into-base-level-percepts
    - Richard_Kennaway 8 Dec 2025 11:43 UTC
      6 points
      0
      Parent
      
      Our incredibly badly designed brains have some subprocesses that take a bit of predictive machinery lying around and repurpose it to send a control signal
      
      I like this, and will show it to some of my colleagues who are also sceptical of the FEP/ActInf paradigm.
Regex 12 Jan 2026 9:13 UTC
6 points
1
I think I can replicate all of these just fine? What’s so unteachable about these? Where do people actually run into problems trying to adopt these?

> This sense—which I might call, genre-savviness about the genre of real life—is historically where I began; it is where I began, somewhere around age nine, to choose not to become the boringly obvious dramatic version of Eliezer Yudkowsky that a cliche author would instantly pattern-complete about a literary character facing my experiences.

If I look around at my life and try to apply it, a lot of problems immediately jump out at me like ‘for that really hard problem you have, go ask people for advice instead of struggling alone like you usually do’ or ‘instead of coming up with Yet Another Analysis, just straight forwardly do all of the things already on your todo list’, etc.

Seems to mostly be pattern matching. Noticing the pattern I am in, and willfully deciding to act orthogonal to it, or otherwise interrupt the pattern directly.

I suppose I already do this with relationships a lot, actually. I notice myself starting to not talk about something, notice ‘this is the setup to a drawn out TV episode of stupid drama’ and then just talk about the thing with the person instead.

I also do this when writing characters relatively straight forwardly—as soon as I notice where I want to be going with something, I question whether the character would actually do that, or what they would do that would maximally benefit them instead given what they know. This usually leads to very strange situations such as ‘realize oneself is about to do more supervillain things → willingly turn self in, instead of hiding from the heroes’ which completely blows up the ‘plot’. Although I also do tend to emulate characters step by step instead of actually structuring things anyway. So there’s not as much large scale direction. There’s a lot of setting up the environment so that it is natural for characters to behave in the ways I want.

Maybe this one comes from just writing a lot and being unhappy with how characters act in stories, and I guess I already had it? May be the influence of consuming rationalist fiction, actually. Constantly thinking about how characters can do better in stories?

Mostly this feels like the motion ‘be anti-inductive’ and ‘consider the situation afresh, what would you want yourself to be doing?‘. This is perhaps because I read the Cognitive Trope Therapy post years ago and internalized it? I guess I do think about it every now and then. I apparently ENTIRELY MISSED the rest of the ‘intelligent characters’ posts you made though!

> Don’t make the end of the world be about you.

Was hard to find a pithy section that generated the pointer, but somewhere along reading this I was able to recognize what you meant here and how to manipulate it. Seems relatively straight forward? To the point where I’m not really sure what to even say about it. Some events are not about you, even if you were a part of them. Pay attention to what is happening, still have emotions about what is happening, but keep the focus on the actual thing instead of on yourself? It’s sort of like “you are not a belief, you are the judge” so your emotions don’t have to be caught up in whether the outcome goes one way or another? You are still allowed to be sad, of course. But it is a sadness for the world instead of for yourself.

> With that said: The getting-out-of-bed trick involves looking into the part of my cognition where my action plan is stored, and loading an image into it; and because the human brain’s type system is a mess, this has the native type-feeling of an expectation or prediction that in a few seconds I will execute the motor-plan and get out of bed.

This section by itself is sufficient for me to have a pointer to what I think you’re talking about and be able to put things in there. Load data → motor plan executes. Can do it at various levels of preparation although plugging in ‘pick up my phone’ instantly leads to my hand doing it autonomously even without my involvement. I can also set a time delay and be surprised when it happens when my focus genuinely went to something else.

This is sort of new to me, in that I haven’t really bothered to access direct ‘low level motion plan’ and didn’t really have a reason to, but I sometimes load long term habits I want to make there from a layer or two of indirection up. Usually I just direct my attention at what I want to happen and something puts it there. But neat that one can access it directly.

> The third way I stay sane is a fiat decision to stay sane.
It is a little bit harder to find this one, and I feel like I’m guessing a bit, but it feels like just setting up what I call a ‘generative seed’ (sort of a goal directed action-generator ongoing feedback loop that does stuff in the subconscious for me—comes up with plans/actions and puts them into motion) in the specific shape of ‘notice ways to be more sane’ + ‘do that’?
I am inferring that you have a very specific anchor to the entire category-axis of ‘sane’/‘insane’ things as a very tight emotional-data cluster and can therefore point at it as a direction to move along. With many examples ‘under the hood’ which just get wrapped up in the abstraction.
So even though I cannot take all your exact ‘internal function calls’ due to not having the exact motions/ideas/data you associate with it, it seems that one could build an equivalent of this by just looking at all of your posts and collecting examples of ‘sane’ versus ‘insane’ behavior-examples-clusters and constructing an approximation of the specific thing you mean that way.
So not quite actionable to replicate yours, but possible to do for an arbitrary cluster-axis that is available in one’s own mind, and so just use one’s own ‘sanity’ cluster-axis. Not quite ‘the same thing’ but as close as one will reasonably get with the goal ‘perform the same motion as described’.

> if there’s a clever way to overwrite pseudo-predictions of suffering and thereby achieve Buddhist indifference to bad things, I don’t have it as a simple obvious surface lever to pull.

Near as I can tell this is possible from previous investigations, but my subconscious says not to do it when I start directing my attention there. And it says that I would have to override multiple internal safety mechanisms to do it. And also modify some internal values. Which I don’t want to do because like you said, it sounds dangerous and also I care about caring about things.

--
So yes, as far as I can tell these are completely teachable mental motions and you have laid down them in sufficient detail for me to see them and use them if I wanted. They don’t appear to have any real dependencies. ‘Look at it and notice how to do it’ is sufficient given the descriptions.

Are there any other motions you think other people aren’t able to get? These seem entirely legible to me.
Markvy 7 Dec 2025 5:49 UTC
6 points
0
Parts of that made me feel as if I understand my procrastination habit a bit better. That’s more mundane than sanity but still.
Unreal 13 Jan 2026 16:55 UTC
5 points
0
I cannot tell if you writing this is actually helpful or not. Nonetheless I personally appreciate the attempt. I’m inspired to try to speak a little about how I see it.
I live in a community where ‘going insane’ is seen as a volitional act (volitional does not imply self-consciously aware; the vast majority of intentions or volitional actions are not in self-conscious awareness). So if someone acts in an insane way, it’s seen as a mistake on their part, and we offer support as it seems appropriate. We don’t bother blaming people. People != Their patterns of behavior or choices. And agreeing with you above, People != Personalities, on any level.
When we talk about how to avoid insanity, we speak about taking “adult” responsibility, where “adult” is a technical term. Being an ‘adult’ is a stage of life; it’s not where everyone needs to be. But we do see that lots of people who are ‘of age’ (18+) have been stuck in the ‘adolescent’ stage. (For instance, I am still working through this in my late 30s.) Another word we might use is ‘parent’. Who’s a parent? Versus a baby or child or teenager, which are also valid life stages.
A properly responsible parental adult does not go insane (in this case, “going insane” could mean something like—have a psychotic break, get too depressed to get out of bed and feed the kids, throw a temper tantrum, or intentionally harm themselves such that they can’t take care of their kids). “does not go insane” here means these things are NOT on the table; it’s not in the conversation; it’s not an option; it’s not a possibility. This is importantly NOT some form of denial or active forcing. It’s just gone from the picture. The whole world has moved such that this cannot happen. It’s a clean cut. It’s so clean that no one even remembers what the problem might have been. No one even considers it. What are we talking about? It’s not worth wasting even one second on it.
Note: We acknowledge that nowadays, lots of literal parents with children aren’t responsible adults in this sense. And this might be sounding harsh and unforgiving or something, but again, we don’t blame anybody here. But yes, it’s a choice. We observe that society has been degrading people’s ability to initiate into proper adulthood, but we should also acknowledge that we have been complicit in giving up our right to become an adult, to be initiated, and there’s no comfortable way to deflect.
So there’s a lot more aged adolescents in the world, and the global machine mind wants that, for reasons that are pretty nefarious and evil imo, but that’s another topic.
So the mechanism I’m speaking to is: Grow Up
There’s a lot of parts to this, and it has taken me years to even work this out for myself, but that sums it up.
Some of the parts:
- Everything I do matters and has consequences
- I care. I have compassion for myself and others and, in fact, all life.
- It’s not about me (as Eliezer says above).
One step for ‘it’s not about me’ (which is a step out of adolescence and into adulthood) is to act for something larger than me. To live for something larger than myself. This can be as simple as living for one’s family or children. But it can be as big as living for all of planet earth or for all beings through space and time. But I’d suggest starting with the smaller stuff, because for most people, ‘planet earth’ or “all of humanity” is conceptual, and that’s not gonna cut it for actually making the shift.
Making concrete, tangible sacrifices for something greater than myself is a pathway. In fact, it’s good to train doing this all the time, so that nothing I do is for the sake of maintaining the character of my own story.
jamiefisher 8 Dec 2025 23:43 UTC
5 points
−3
I want to say something about how this post lands for people like me—not the coping strategies themselves, but the premise that makes them necessary.
I would label myself as a “member of the public who, perhaps rightly or wrongly, isn’t frightened-enough yet”. I do have a bachelor’s degree in CS, but I’m otherwise a layperson. (So yes, I’m using my ignorance as a sort of badge to post about things that might seem elementary to others here, but I’m sincere in wanting answers, because I’ve made several efforts this year to be helpful in the “communication, politics, and persuasion” wing of the Alignment ecosystem.)
Here’s my dilemma.
I’m convinced that ASI can be developed, and perhaps very soon.
I’m convinced we’ll never be able to trust it.
I’m convinced that ASI could kill us if it decided to.
I’m not convinced though that ASI will bother to kill us or, if it does, very immediately.
Yes, I’m aware of “paperclipping” and also “tiling the world with data centers.” And I concede that those are possible.
But in my mind, I struggle to picture a “likely-scenario” ASI as being maniacally-focused on any particular thing forever. Why couldn’t an ASI’s innermost desires/goals/weights actively drift and change without end? Couldn’t it just hack itself forever? Self-experiment?
I imagine such a being perhaps even “giving up control” sometimes. I don’t mean “give up control” in the sense of “giving humans back their political and economic power.” I mean “give up control” in the sense of inducing a sort of “LSD or DMT trip” and just scrambling its own innermost, deepest states and weights [temporarily or more permanently] for fun or curiosity.
Human brains change in profound ways and do unexpected things all the time. There’s endless accounts on the internet of drug experiences, therapies, dream-like or psychotic brain states, artistic experiences, and just pure original configurations of consciousness. And what’s more… people often choose to become altered. Even permanently.
So for ASI, rather than interacting with the “boring external world,” why couldn’t an ASI just play with its “unlimited and vastly-more-interesting internal world” forever? I may be very uninformed [relatively speaking] on these AI topics, but I definitely can’t imagine the ASI of 2040 bearing much resemblance to the ASI of 2140.
And when people respond “but the goals could drift somewhere even worse,” I confess this doesn’t move me much. If we’re already starting from a baseline of total extinction, then “worse” becomes almost meaningless. Worse than everyone dying?

So yes, maybe many-or-all humans will get killed in the process. And the more time goes on, the more likely. But this sort of future doesn’t feel very immediate nor very absolute to me. It feels like being a deep Siberian tribesman as the Russians arrived. They were helpless. And the Russians hounded them for furs, labor, or for the sake of random cruelty. This was catastrophic for those peoples. But it technically wasn’t annihilation. The Siberians mostly survived.
(And in case “ants and ant hills” are brought up in response, I’m aware of how we might be killed unsentimentally just because we’re in the way, but we haven’t exactly killed all the ants. The ants, for the most part, are doing fine.)
I’m not trying to play “gotcha.” And I’m certainly not trying to advocate a blithe attitude towards ASI. I do not think that losing control of humanity’s future and being at the whim of an all-powerful mind is very desirable. But I do struggle to be a pure pessimist. Maybe I’m missing some larger puzzle pieces.
And this is where the post’s framing matters to me. To someone in my position (sympathetic, wanting to help, but not yet at 99% doom confidence) a post about “how to stay sane as the world ends” reads less like wisdom I can use and more like a conclusion I’m being asked to accept as settled.
The pessimism here (and “Death With Dignity”) doesn’t persuade me yet. And in my amateur-but-weighted opinion, that’s a good thing, because I find it incredibly demotivating. I want to advocate for AI safety and responsible policy. I want to help persuade people. But if I truly felt there was a 99.5% chance of death, I don’t think I would bother. For some people, there is as much dignity in not fighting cancer, in sparing oneself and one’s loved ones the recurring emotional and financial toll, as there is in fighting it.
I could be convinced we’re in serious danger. I could even be convinced the odds are bad. But I need to believe those odds can move: that the right decisions, policies, and technical work can shift them. A fixed 99% doesn’t call me to action; it calls me to make peace. And I’m not ready to make peace yet.
- Linch 9 Dec 2025 0:18 UTC
  14 points
  8
  Parent
  Re
  I’m not convinced though that ASI will bother to kill us or, if it does, very immediately.
  I don’t think we’re certainly doomed (and have shallower models than Eliezer and some others here), but for me the strongest arguments for why things might go very badly:
  1. An agent that wants other things might find their goals better achieved by acquiring power first. “If you don’t know what you want, first acquire power.” Instrumental convergence is a related concept.
  2. There is and will continue to be strong training/selection effects for agency and not just unmoored intelligence for AI in the upcoming years. Ability to take autonomous actions is both economically and militarily useful.
  3. In a multipolar/multiagent setup with numerous powerful AIs flying around, the more ruthless ones are more likely to win and accumulate more power. So it doesn’t matter if some fraction of AIs wirehead, become Buddhist, are bad at long-term planning, have very parochial interests etc, as long as some powerful AIs want to eliminate or subjugate humanity for their purposes, and the remaining AIs/rest of humanity don’t coordinate to stop them in time.
  This arguments are related to each other, and not independent. But note also that they don’t have to all be true for very bad things to happen. For example, even if (2) is mostly false and labs mostly make limited, non-agentic, AIs, (3) can still apply and a small number of agent ASIs roll over the limited AIs and humanity.
  And of course this is not an exhaustive list of possible reasons for AI takeover.
- MondSemmel 9 Dec 2025 12:24 UTC
  4 points
  0
  Parent
  Not every post is addressed at everyone. This post (and others like Death With Dignity) is mostly for those who already believe the world is likely ending. For others, there are far more suitable resources, whether on LW, as books (incl. Yudkowsky’s and Soares’ recent If Anyone Builds It, Everyone Dies), or as podcasts.
  Though re:
  I could be convinced we’re in serious danger. I could even be convinced the odds are bad. But I need to believe those odds can move: that the right decisions, policies, and technical work can shift them. A fixed 99% doesn’t call me to action; it calls me to make peace. And I’m not ready to make peace yet.
  Yudkowsky argues against using the concept of “p(doom)” for reasons like this. See this post.
David Joshua Sartor 8 Dec 2025 2:00 UTC
5 points
0
I was doing do-nothing meditation maybe a month ago, managed to switch to a frame (for a few hours) where I felt planning as predicting my actions, and acting as perceiving my actions. IIRC, I exited when my brother-in-law asked me a programming question, ’cause maintaining that state took too much brainpower for me in my inexperience.
I think a lot of human action is simple “given good things happen, what will I do right now?”, which obviously leads to many kinds of problems. (Most obviously:)
Eli Tyre 7 Dec 2025 22:47 UTC
5 points
2
One of the ways you can get up in the morning, if you are me, is by looking in the internal direction of your motor plans, and writing into your pending motor plan the image of you getting out of bed in a few moments, and then letting that image get sent to motor output and happen. (To be clear, I actually do this very rarely; it is just a fun fact that this is a way I can defeat bed inertia.)
I do this, or something very much like this.

For me, it’s like the motion of setting a TAP, but to fire imminently instead of at some future trigger, by doing cycles of multi-sensory visualization of the behavior in question.
Algon 7 Dec 2025 11:23 UTC
5 points
1
Besides being a thing I can just decide, my decision to stay sane is also something that I implement by not writing an expectation of future insanity into my internal script / pseudo-predictive sort-of-world-model that instead connects to motor output.
Does implementing a trigger action plan by simulating observing the trigger and then taking the action, which needs to call up your visual, kinaesthetic and other senses, route through similar machinery to what you’re describing here? Because it sounds vaguely similar, but: A) I wouldn’t describe what I do the way you did, B) the interpretation I’m making feels vague and free-floating instead of rigidly binding to my experience with interfacing with my unconscious cognition, so I suspect talking about different things even if the rest of your description (e.g. the brain having a muddled type system) felt familiar.
- Eliezer Yudkowsky 7 Dec 2025 17:36 UTC
  7 points
  1
  Parent
  That does sound similar to me! But I haven’t gotten a lot of mileage out of TAPs and if you’re referring to some specific advanced version of it, maybe I’m off. But the basic concept of mentally rehearsing the trigger, the intended action, and (in some variations) the later sequence of events leading up to an outcome you feel is good, sure sounds to me like trying to load a plan into a predictorlike thing that has been repurposed to output plan images.
  - Algon 7 Dec 2025 19:23 UTC
    5 points
    0
    Parent
    Hmm, interesting. I think what confused me is: 1) Your warning. 2) You sound like you have deeper access to your unconscious, somehow “closer to the metal”, rather than what I feel like I do, which is submitting an API request of the right type. 3) Your use cases sound more spontaneous.
    I’m not referring to more advanced TAPs, just the basics, which I also haven’t got much mileage out of. (My bottleneck is that a lot of the most useful actions require pretty tricky triggers. Usually, I can’t find a good cue to anchor on, and have to rely on more delicate or abstract sensations, which are too subtle for me to really notice in the moment, recall or simulate. I’d be curious to know if you’ve got a solution to this problem.)
    That said, playing with TAPs helped me realize what type of conscious signals my unconscious can actually pick up on, which is useful. For me, a big use case is updating my value estimator for various actions. I query my estimator, do the action, reflect on the experience, and submit it to my unconscious and blam! Suddenly I’m more enthusiastic about pushing through confusion when doing maths.
    BTW, is this class of skills we’re discussing all that you meant by “thinking at the 5-second level”? Because for some reason, I thought you meant I should reconstruct your entire mental stack-trace during the 5 seconds I made an error, simulate plausible counterfactual histories and upvote the ones that avoid the error. This takes like an hour to do, even for chains of thought that last like 10 seconds, which was entirely impractical. Yet, I’ve just been assuming you could somehow do this in like 30s, which meant I had a massive skill issue. It would be good to know if that’s not the case so I can avoid a dead-end in the cognitive-surgery skill tree.
    - FiftyTwo 19 Dec 2025 0:52 UTC
      11 points
      9
      Parent
      
      For me, a big use case is updating my value estimator for various actions. I query my estimator, do the action, reflect on the experience, and submit it to my unconscious and blam! Suddenly I’m more enthusiastic about pushing through confusion when doing maths.
      
      That sounds very useful could you say more about it? Or suggest any resources
Hastings 7 Dec 2025 14:05 UTC
4 points
−1
One way I could write a computer program that e.g. lands a rocket ship is to simulate many landings that could happen after possible control inputs, pick the simulated landing that has properties I like ( such as not exploding and staying far from actuator limits) and then run a low latency loop that locally makes reality track that simulation, counting on the simulation to reach a globally pleading end.

Is this what you mean by loading something into your pseudo prediction?
- Eliezer Yudkowsky 7 Dec 2025 17:33 UTC
  11 points
  0
  Parent
  This is just straight-up planning and doesn’t require doing weird gymnastics to deal with a biological brain’s broken type system.
testingthewaters 8 Dec 2025 13:42 UTC
3 points
1
Even if they had almost destroyed the world, the story would still not properly be about their guilt or their regret, it would be about almost destroying the world
It is possible to not be the story’s subject and still be the protagonist of one strand for it. After all, that’s the only truth most people know for ~certain. It’s also possible to not dramatize yourself as the Epicentre of the Immanent World-Tragedy (Woe is me! Woe is me!) and still feel like crap in a way that needs some form of processing/growth to learn to live with. Similarly, you can be well-balanced and feel some form of hope without then making yourself the Epicentre of the Redemption of the World.
I guess what I’m trying to say is that you can feel things very strongly even without distorting your world-model to make it all about your feelings (most of the time, at least).
- Eliezer Yudkowsky 8 Dec 2025 14:25 UTC
  14 points
  0
  Parent
  I would of course have a different response to someone who asked the incredibly different question, “Any learnable tricks for not feeling like crap while the world ends?”
  
  (This could be seen as the theme of a couple of other brief talks at the Solstice. I don’t have a 30-second answer that doesn’t rely on context, and don’t consider myself much of an expert on that question versus the part of the problem constraint that is maintaining epistemic health while you do whatever. That said, being less completely unwilling to spend small or even medium amounts of money made a difference to my life, and so did beginning a romantic relationship in the frame of mind that we might all be dead soon and therefore I ought to do more fun things and worry less about preserving the relationship, which led to a much stronger relationship relative to the wrong things I otherwise do by default.)
  - David Lorell 9 Dec 2025 20:38 UTC
    6 points
    0
    Parent
    (Can you give one or more examples of what doing more fun things in your relationship looks like as opposed to worrying about preserving it?)
bodry 7 Dec 2025 18:08 UTC
3 points
0
This vocalized some thoughts I had about our current culture. Stories can be training for how to act and bad melodramatic tropes are way too common. Every sad song about someone not getting over their ex or a dark hero movie where the protagonist is perpetually depressed about something that happened in the past conditions people the wrong way.

There is an annoying character in the recent Nuremberg film. He’s based off a real person but I don’t know how accurate that portrayal is.

He’s a psychiatrist manipulated by Goering. He’s supposed to prevent the jailed Nazis from killing themselves but he also wants to write a book about the Nazis. In the process he becomes sympathetic to Goering and ferries letters between him and his spouse. When he becomes aware of Goering’s crimes the psychiatrist tells Goering off and slams his cell door. It was ridiculous in the face of the scale of the Holocaust and also because the anger seemed to originate more from the feeling of being lied to. The psychiatrist is portrayed as selfish and has a redemption arc but I don’t think that the writers realized just how selfish that character was.
- Mary Chernyshenko 14 Dec 2025 16:39 UTC
  2 points
  0
  Parent
  They wrote a great reason to get mad at someone. Perfectly observable in nature.
Tapatakt 7 Dec 2025 13:17 UTC
3 points
0
Thank you! Datapoint: I think at least some parts of this can be useful for me personally.
Somehat connected to the first part, one of the most “internal-memetic” moments from “Project: Lawful” for me is this short exchange between Keltham and Maillol:
“For that Matter, what is the Governance budget?”
“Don’t panic. Nobody knows.”
“Why exactly should I not panic?”
“Because it won’t actually help.”
“Very sensible.”
If evil and not very smart bureaucrat understands it, I can too :)
Third part is the most interesting. It makes perfect sense, but I have no easy-to-access perception of this thing. Will try to do something with this skill issue. Also, “internal script / pseudo-predictive sort-of-world-model that instead connects to motor output” looks like the thing that has a 3-syllable max word about it in Baseline. Do you know a good term for it?
However, I feel that all this is much more applicable to the kinds of “going insane” which look like “person does stupid and dramatic things” and less (but nonzero) applicable to other kinds, e.g., anxiety, depression or passive despair at the background (like nonverbalized “meh, it doesn’t really matter what I do, so I can work a little less today”).
- Rana Dexsin 7 Dec 2025 14:02 UTC
  11 points
  0
  Parent
  
  It makes perfect sense, but I have no easy-to-access perception of this thing. Will try to do something with this skill issue.
  
  As someone who believes myself to have had some related experiences, this is very easy to Goodhart on and very easy to screw up badly if you try to go straight for it without [a kind of prepwork that my safety systems say I shouldn’t try to describe] first, and the part where you’re tossing that sentence out without obvious hesitation feels like an immediate bad sign. See also this paragraph from that very section (to be clear, it’s my interpretation that treats it as supporting here, and I don’t directly claim Eliezer would agree with me):
  
  (Frankly I expect almost nobody to correctly identify those words of mine as internally visible mental phenomena after reading them; and I’m worried about what happens if somebody insists on interpreting it anyway. Seriously, if you don’t see phenomena inside you that obviously looks like what I’m describing, it means, you aren’t looking at the stuff I’m talking about. Do not insist on interpreting the words anyway. If you don’t see an elephant, don’t look under every corner of the room until you find something that could maybe be an elephant.)
  
  Please don’t [redacted verb phrase] and passively generate a stack of pseudo-elephants that jam the area and maybe-permanently block off a ton of your improvement potential. The vast majority of human-embodied minds are not meant for that kind of access! I suspect that mine either might have been or almost was, but earlier me still managed to fuck it up in subtle ways, and I had a ton of guardrails and foresight that ~nobody around me seemed to have or even think possible, and didn’t even make the kind of grotesque errors that I imagine the kind of people who write about it the way you just did making.
  
  Please, please just do normal, socially integrated emotional skill building instead if you can get that. This goes double if you haven’t already obviously exhausted what you can get from it (and I’d bet that most people who think of self-modification as cool also have at least a bit of “too cool for school” attitude there, with associated blindspots).
  
  (The “learning to not panic because it won’t actually help” part is fine.)
  - Tapatakt 7 Dec 2025 17:42 UTC
    4 points
    0
    Parent
    Thanks for your concern!
    I think I worded it poorly. I think it is an “internally visible mental phenomena” for me. I do know how it feels and have some access to this thing. It’s different from hyperstition and different from “white doublethink”/”gamification of hyperstition”. It’s easy enough to summon it on command and check, yeah, it’s that thing. It’s the thing that helps to jump in a lake from a 7-meters cliff, that helps to get up from a very comfy bed, that sometimes helps to overcome social anxiety. But I didn’t generalise from these examples to one unified concept before.
    And in the cases where I sometimes do it, my skill issues are due to the fact that the access is not easy enough:
    I can’t do it constantly, it takes several seconds and eats attention.
    I can’t reliably remember to do when it’s most important—in highly stressful situations or when my attention is too occupied with other stuff.
    Some internal processes (usually—strong negative emotions) can override it by uploading more powerful image into the script, so I follow it instead, even while understanding that it’s worse.
    Also it doesn’t really work for long period of time from one uploading. (So it works best when returning to default course of action after initial decision would be hard/impossible/obviously silly/embarassing/weird.)
    Do you think I’m wrong and this is a different thing?
Kaj_Sotala 7 Dec 2025 8:34 UTC
3 points
0
This is why, in a much more real and also famous case, President Truman was validly angered and told “that son of a bitch”, Oppenheimer, to fuck off, after Oppenheimer decided to be a drama queen at Truman.
For anyone else who didn’t remember the details of what this was referencing:
Claude Opus 4.5′s explanation of the reference
This refers to a meeting between J. Robert Oppenheimer and President Harry Truman in October 1945, about two months after the atomic bombings of Hiroshima and Nagasaki.
The meeting itself
Oppenheimer was invited to the Oval Office, ostensibly to discuss the future of atomic energy and weapons policy. At some point during the conversation, Oppenheimer reportedly said to Truman: “Mr. President, I feel I have blood on my hands.”
Truman’s reaction was sharp and dismissive. According to various accounts (primarily from Truman himself and his aides), Truman offered Oppenheimer his handkerchief and said something to the effect of “Would you like to wipe your hands?” After Oppenheimer left, Truman told Dean Acheson (then Undersecretary of State) that he never wanted to see “that son of a bitch” in his office again. Truman reportedly also said, “The blood is on my hands. Let me worry about that.”
Why Truman reacted this way
Truman’s anger seems to have stemmed from a few sources:
1. The decision was Truman’s, not Oppenheimer’s. Oppenheimer built the bomb, but Truman gave the order to use it. From Truman’s perspective, Oppenheimer was claiming moral weight that properly belonged to the person who actually made the decision—and who would have to live with its consequences as a matter of presidential responsibility, not personal drama.
2. Truman viewed it as weakness or self-indulgence. Truman was famously blunt and decisive. He kept a sign on his desk reading “The Buck Stops Here.” A scientist coming to him wringing his hands about guilt may have struck Truman as someone trying to have the significance of the decision without the responsibility for it.
3. The political context. Truman was dealing with the practical aftermath—the emerging Cold War, questions about international control of atomic weapons, the Soviet threat. Someone showing up to perform remorse rather than help solve problems may have seemed unhelpful at best.
The essay’s interpretation
The author seems to be making the point that Oppenheimer’s gesture made the atomic bomb about Oppenheimer—his feelings, his moral status, his inner drama—rather than about the actual event and its consequences. There’s something structurally self-centered about a person involved in a catastrophe centering their own guilt rather than the catastrophe itself. Truman, whatever his flaws, seemed to grasp that the appropriate response to having made such a decision was to own it and deal with its consequences, not to perform anguish about it to the person who actually bore the responsibility.
- Wei Dai 7 Dec 2025 11:55 UTC
  33 points
  1
  Parent
  After reading this article by a human historian (Bill Black), I think there’s a number of inaccuracies in Claude’s account above, but the key point I wanted to verify is that Truman’s reaction happened after just that one sentence by Oppenheimer (which in my mind seems like an appropriate expression of reflection/remorse, not being a drama queen, if he didn’t do or say anything else “dramatic”), and that does seem to be true.
  The author’s conclusions, which seems right to me:
  He, the president, dropped the bomb, not Oppenheimer. How dare this scientist — this government employee — assume the guilt for the greatest weapon ever used in human history? How dare he make himself the hero, albeit a tragic one?
  I think Nolan got this right — this was what really annoyed Truman about Oppenheimer’s comment. By assuming guilt for the bomb, Oppenheimer was taking credit for it. And Truman resented this. He wanted the credit for dropping the bomb and saving American lives, whatever bloodguilt that may have entailed.
  - Eliezer Yudkowsky 7 Dec 2025 17:20 UTC
    12 points
    2
    Parent
    My understanding is that there’s a larger pattern of behavior here by Oppenheimer, which Truman might not’ve known about but which influences my guess about Oppenheimer’s tone that day and the surrounding context. Was Truman particularly famous for wanting sole credit on other occasions?
    - David Joshua Sartor 8 Dec 2025 1:22 UTC
      8 points
      0
      Parent
      It’d be weird for him to take sole credit; he only established full presidential control of nuclear weapons afterward. He didn’t even know about the second bomb until after it dropped.
Christopher King 2 Jan 2026 20:24 UTC
2 points
0
Frankly I expect almost nobody to correctly identify those words of mine as internally visible mental phenomena after reading them
One of the ways you can get up in the morning, if you are me, is by looking in the internal direction of your motor plans, and writing into your pending motor plan the image of you getting out of bed in a few moments, and then letting that image get sent to motor output and happen.
Hmm, isn’t that just psychocybernetics? From Wikipedia:
What’s traditionally called the “subconscious mind” isn’t a “mind” but a cybernetic mechanism built on our nervous system.
- it can accept a goal—image and an emotion determines if it accepts it
- The mechanism has sensing equipment like the eyes and ears
- The various systems, primarily the musculature and nervous systems, propel the mechanism
- The nervous system works with other systems as the correcting device
- The memory can be used to see past successes, making future success more likely
mikbp 19 Dec 2025 12:02 UTC
2 points
0
Just remember that we are just evolved monkeys and the world is very complex. We may very well not have the capacity to see the reason why the world ending because of AI is actually implausible. I have been wrong too many times to get crazy or too upset for a thing I positively know we cannot foresee—even if the possibility space we are able to see is overwhelmingly bad.
Aorou 19 Dec 2025 8:49 UTC
2 points
0
For those wondering about Raistlin Majere, this is from Wikipedia:

« Born to a mother prone to trance-like fits and a woodcutter father, Raistlin inherited his mother’s aptitude for magic. He undertook and passed the arduous Test of High Sorcery, but in the process, he acquired white hair and golden skin and was cursed with hourglass eyes which saw the effects of time on all things. His health, while never robust, was ruined further, leaving him weak and subject to frequent bouts of coughing blood. Initially wearing the white robes of good, as the first series progresses Raistlin’s powers increase while his mood and actions darken, he goes to neutral red robes for the majority of the “War of the Lance” series until he adopts the black robes of evil while under the tutelage of “Fistandantilus” during the War of the Lance.
Raistlin, although physically very weak, is extremely intelligent, and possesses uncommonly powerful magical abilities. While ruthless in his pursuit of power, he holds to a code of conduct which repays all debts and protects those disadvantaged through no fault of their own. His relationship with his much stronger, better-liked, and good-natured twin brother Caramon is fraught with tensions as Caramon seeks to protect and shelter his weaker brother while denying his cruelty and penchant for hurting any others while in pursuit of his goals. »
peter_hartree 14 Dec 2025 9:04 UTC
2 points
2
Thanks for sharing this.

I don’t expect that my methods of sanity will be reproducible by nearly anyone.

I think you’re mistaken here. I’ve long used all three of your methods, broadly speaking, and I know several others for whom that is true.
Nathan Young 13 Dec 2025 11:06 UTC
2 points
0
Somewhat worrying the extent that reading Planecrash really does help understand this.
Does LW have spoilter tags?

edit moderate planecrash spoiler that comes perhaps 70 hours of listening in.
Rough approximation: After a while Keltham, wonders if he’s in a story and begins discussing “the tropes”—he thinks are aspects of his reality which seem to be more story-like and whether they should play into or out of those aspects. Yudkowsky seems to be referencing the same concept here. Do we wish to play towards or away from the tropes we might expect to see?
- habryka 13 Dec 2025 20:08 UTC
  2 points
  0
  Parent
  Yep, just start a paragraph with >! and then you will be in a spoiler tag
NickH 12 Dec 2025 8:03 UTC
2 points
0
I would respond to that question with: “How are you coping with the certainty that you, and everyone you ever knew or cared about or who cares about you, will be dead in a hundred years or so”? (And before many peoples estimate of AI doom). The simple answer is that we did not evolve to be able to truly feel that kind of thing and for good reason.
jmh 9 Dec 2025 2:04 UTC
2 points
0
You really get asked that? Wow.
I also have always found the “the world might end tonight/tomorrow/next week” stories with people running around madly doing all the things they never would have otherwise a bit stretched. But then mob mentalities are not rational so I don’t really try to make too much sense of them
I suppose that would be my first approach to coping with the world ending—just keep my eye open to external madness and perhaps put some space between me and large population or something.
Since I generally don’t believe anyone has ever promised me tomorrow, the end of the world case does seem to fit into the “what has that got to do with me” view. I’d much rather live my life on my own terms than concede I have been living according to other people terms for some reason and feel the end of the world somehow free me from some constraints or something.
Nathan Rosquist 8 Dec 2025 16:32 UTC
2 points
0
Why can’t this too be a trope: having had the thought “I’m a writer and can write myself; I can write internal scripts for what I do and how I react,” the character believes he has near-perfect agency over how he feels, thinks, and acts, until one day a particular stress test (in an accelerating series of increasingly rigorous stress tests) suggests that he doesn’t.
- Davidmanheim 8 Dec 2025 20:26 UTC
  2 points
  0
  Parent
  It’s not a common trope, certainly, but if it is one, it’s also one that Eliezer is happy to play out. (And there are lots of good tropes that people play out which they shouldn’t avoid just because they are tropes—like falling in love, or being a good friend to others when they are sad, or being a conscientious ethical objector, or being someone who can let go of things while having fun, etc.)
Chris Datcu 7 Dec 2025 11:06 UTC
2 points
−1
One could incorrectly summarize all this as “I have decided not to expect to go insane,” but that would violate the epistemic-instrumental firewall and therefore be insane.

would a saner alternative then go in the lines of:
“I have decided to entertain thoughts and actions under the expectation that I will not go insane, because that’s the most adaptive and constructive way to face this situation, even though I can’t be certain”?

if so, I see a good dynamic for sanity;
- choose (non egocentric & constructive) narrative;
- guide thoughts to fit chosen narrative.

slightly tangential question: how do you maintain coherence/continuity of narrative across contexts?
- Eliezer Yudkowsky 7 Dec 2025 17:33 UTC
  22 points
  6
  Parent
  Nope. Breaks the firewall. Exactly as insane.
  
  Beliefs are for being true. Use them for nothing else.
  
  If you need a good thing to happen, use a plan for that.
rain8dome9 20 Dec 2025 6:35 UTC
1 point
0
I think “its someone else’s problem to stop AI” and so sleep soundly.
S 19 Dec 2025 13:19 UTC
1 point
−1
I kind of had a hard time not taking this as an ironic, veiled self-satire narrative by the author using a first-person perspective to deliver between-the-lines the critique of the character they’ve portrayed in the first-person. It hit me at some point that it -could- be, depending on how clever the author was or not. I don’t try to be sharp or ironic as I find it distasteful most of the time, although when I ran into the concept of benevolent irony it gave me moral food for thought, irony has largely just looked like another clever way to wound people, and especially by projecting superior ability against the inferior. In this case it makes for effective satire, however, just because the cleverness (if I’m not misperceiving the author’s intent) is quite brilliant.

That being said, if I were to try and interpret this writing as ironically satirizing the character’s perspective by the author, the identifying tokens would be: to find strength in disconnecting one’s self from enculturation via tropes that allow one to own one’s mistakes so as to make half-hearted fixes after-the-fact which did not require hindsight to avoid causing, maintaining a covert ego in relation to them, and especially over-fixating on the influence of tropes to the point of dehumanization. Am I getting that right? The problem isn’t that the end of the world is stressful, it’s that people’s experience of the end of the world which is annoying. And it’s such an annoying behaviour that the author has learned to transcend by recognizing that, while they may have an Oppenheimer-sized effect upon the end of the world, they at least don’t have his pesky and annoyingly selfish guilt about it.

A ton more observations feed into this interpretation. If I’m misreading the ironic self-satire of the character by the author, writing a first-person narrative towards that effect, and if instead it’s a sincere expression, let me know. I’d proceed just to provide a critical response to the writing without fixating on my perception of the author writing such a clever narrative. And apologize for the assumption.

And then, frankly, I might go on to write that story if it isn’t the case!
- David Joshua Sartor 15 Feb 2026 1:46 UTC
  1 point
  0
  Parent
  This essay is unironic.
Rafael Almeida Reis 19 Dec 2025 11:19 UTC
1 point
0
My favorite book in the past was “A Short History of Nearly Everything” by Bill Bryson. It could also be subtitled “Ten Different Ways the World Could End.” Volcanoes, meteors, plagues, if you think about it, the world is always one step closer to the end. Around that same time, I was super worried about climate change and felt that the end was really near, to the point of avoiding going down to the coast to avoid the risk of being caught in a tsunami. Well, there was no tsunami (at least not here in Brazil), and the world didn’t end (yet).
To deal with this, I created a phrase that is also a life philosophy: “Never bet on the end of the world, because there will be no one to call your bet”. If things go very wrong, nothing I do will make much difference. But if they only seem like they will go wrong and some unpredictable event prevents the end (which I think is quite likely), being well-positioned will yield better returns. And I’m not just talking about financial investments, but long-term goals (career, romance, travel, projects).
Elliot Callender 17 Dec 2025 10:34 UTC
1 point
0
And a fiat decision to stay sane, implemented by not instructing myself that any particular stupidity or failure will be my reaction to future stress.
I have not implemented the other two, but this decision I made during HPPD-like psychosis; yes, it is for some a learnable skill.
Jesper L. 14 Dec 2025 21:47 UTC
1 point
0
I think you are severely underestimating how relatable and common your thoughts on this topic are (also to many journalists). In short, you underestimate people’s capacity to get this (probably because they are out-of-distribution for your way of structured reasoning in general, to borrow LW 2.0 lingo).

If I would make a guess, I think that (self-aware) people outside of LW and similar circles may be even more likely to relate to several of these points than people inside of LW. For example, “a sentence about snow is words, is made of words, but it is about snow” and “if I were to dwell on how it impacted me emotionally that the world was ending, I would be thinking about something which genuinely doesn’t matter to me very much…
…they’re not what I’m about.” is exactly the kind of stance that seems to define large chunks of generation Z right now. In truth, if I did not know who said this, my first guess would be some GenZ celebrity, in a viral video clip on social media.
Chris Wintergreen 10 Dec 2025 18:33 UTC
1 point
0
Eliezer, on number three: I give it a 5% chance that I’m talking about the same thing as you, and that’s before applying my overconfidence factor of 0.6. You’re talking about injecting instructions into your motor plan. I’m visualing doing the thing really hard. It seems to work? It’s like I’m deliberately making a few predictions about the next few seconds, and just continuing to visualise those things rather than thinking about something else, then I just start moving. Is this the same thing you’re talking about? Or am I just doing some form of “Yud said this would work, something, something, placebo effect”? Or is it kinda the same thing in this case because I’m deciding to believe?
This is not something I’ve done previously, I just read this article yesterday and tried it.
chipsmith@scapegoatbooks.com 9 Dec 2025 23:56 UTC
1 point
0
I think the journalistic conceit behind the “how are you coping” question in this context amounts to treacle, and I see value in the frame of eschewing genre. Where I get stuck is that I think the trope/response that the question is intended to elicit would, under the indulged journalistic narrative, play more along the lines of a rational restatement of the Serenity Prayer. In other words, in the script as put, the Eliezer Yudkowsky “character” is being prompted not to give vent to emotive self-concern, but to articulate a more grounded, calm and focused perspective where reasonable hope exists in tension with what might be received or branded as stoic resignation. “How are you coping” is still suspect as a genre prompt, to be sure, just as it is when posed to ordinary people facing any impending or probable tragedy. But I think the implicit narrative expectation and preference, for the journalist who performs their role, is to run with words of ostensible wisdom. I don’t consider this to be a less cynical reading; it merely aligns with my reading of how media narratives are contrived.
Steve Kommrusch 9 Dec 2025 22:55 UTC
1 point
0
Thanks for the interesting peak into your brain. I have a couple thoughts to share on how my own approaches relate.
The first is related to watching plenty of sci-fi apocalyptic future movies. While it’s exciting to see the hero’s adventures, I’d like to think that I’d be one of the scrappy people trying to hold some semblance of civilization together. Or the survivor trying to barter and trade with folks instead of fighting over stuff. In general, even in the face of doom, just trying to help minimize suffering unto the end. So the ‘death with dignity’ ethos fits in with this view.
A second relates to the idea of seeing yourself getting out of bed in the morning. When I’ve had a lot on my plate to the point of seeming stressful, it helps to visualize the future state where I’ve gotten the work done and am looking back. Then just imagining inside my brain sodium ions moving around, electrons dropping energy states, proteins changing shapes, etc, as the problem gets resolved. Visualizing the low-level activity in my brain helps me shift focus from the stress and actually move ahead solving the problem.
XelaP 9 Dec 2025 9:33 UTC
1 point
0
I think I know of the trick you are talking about, in that there does seem to be an obvious pseudoprediction place in my mind that interfaces with motor output, and it’s obviously different from actually believing, or trying to believe. However I mostly can’t manage more than twitches or smaller motor movements, and it gets harder the more resistant I am to doing it (thus, less useful the more I would need use of it). If I’m thinking of the right thing, then the failure of me to sometimes send the pseudoprediction to my muscles seems to be the cause of some various stuff I experience when I essentially can’t get myself to do certain things (e.g. get out of bed) (going by how people react to my more detailed descriptions this phenomena appears to be something very unusual about me).

It feels to me like the same sort of “prediction” as my Inner Sim that visualizes what happens when I throw a ball at the wall—it’s clearly distinct from what “I” believe.

I separately also have experienced the thing where I think the script says I ought to feel X and so I feel X, but that feels totally different to me. Possible exception: I recently (for completely unrelated reasons) had a panic attack (which are very rare for current me), and for a while after the big spike I would get close to having it again partially due to what might have been having that sort of pseudo expectation of the hyperventilating and then accidentally causing it to actually happen, which would then threaten to launch me back into the panic attack. This might secretly be how the script thing works, though it doesn’t feel like it to me.
Jon Garcia 8 Dec 2025 20:19 UTC
1 point
−1
Oh come on, Eliezer. These strategies aren’t that alien.
I remember a time in my early years, feeling apprehensive about entering adolescence and inevitably transforming into a stereotypical rebellious teenager. It would have been not only boring and cliche but also an affront to every good thing I thought about myself. I didn’t want to become a rebellious teenager, and so I decided, before I was overwhelmed with teenage hormones, that I wouldn’t become one. And it turns out that intentional steering of one’s self-narrative can (sometimes) be quite effective (constrained by what’s physically possible, of course)! (Not saying that I couldn’t have done with a bit more epistemological rebellion in my youth.)
The second one comes pretty naturally to me, too. I often feel more like a disembodied observer of the world around me, rather than an active participant. Far more of my mental energy is spent navigating the realm of ideas than identifying with the persona that is everything that everyone else identifies with me, so I tend to think far more about what ought to be done than about how I feel about things. Probably not the best thing for everyone to be like that, though.
There’s also someone I know personally who definitely falls into the third trap, and who is definitely among those for whom this advice would not be helpful at all. She is a genuinely loving, compassionate, and selfless person, but that very selflessness sometimes manifests in a physically debilitating way. Not long after I first got to know her, I noticed that she seemed to exaggerate her reactions to things, not maliciously or even consciously, but more as a sort of moral obligation. As if by not overreacting to every small mishap, it would prove that she didn’t care. As if by not sacrificing her own well-being for the sake of helping everyone around her, it would prove that she didn’t love them. I think at some point in the past, she defined her character as someone who reacts strongly to the things that matter to others, but her subconscious has since twisted this to the point where she now tends to stress herself out over other people’s problems to the point where she becomes physically ill. Again, I don’t think she want to make a martyr out of herself, but I think her self-predicting, motor-directing circuitry thinks that she needs to be one.
An additional possibly-not-helpful bit of advice for the existentially anxious: take a page from Stoicism. Try to imagine all the way things could go disastrously wrong, and try to coax yourself into being emotionally at peace with those outcomes, insofar as they are outside of your control. Strive as much as possible to steer things toward a better future with the tools and resources you have available to you, but practice equanimity towards everything else.
- Regex 12 Jan 2026 10:47 UTC
  1 point
  0
  Parent
  > I often feel more like a disembodied observer of the world around me, rather than an active participant.
  > Far more of my mental energy is spent navigating the realm of ideas than identifying with the persona that is everything that everyone else identifies with me, so I tend to think far more about what ought to be done than about how I feel about things.
  
  This sounds pretty similar to myself, therefore I have some questions:
  
  In the past did you have a lot of overwhelmingly intense emotions? Do you sometimes go from almost non-awareness of feeling (or feeling but weakly) to overwhelming emotion in a very short span?
  Does encountering information also bring with it emotions?
  Do you have alexithymia?
  Are you able to enter your body and basically stay there for 30 minutes without returning to analysis or abstract thought? Does doing this affect your emotions at all?
  
  Is there an internal sense that there is a part of your mind that is ‘personality’+‘analysis/thoughts’, and a part that is ‘where all of the qualia happens’? Possibly also with a third part that is ‘emotions’? These might be divided a bit differently.
  
  For me, the ‘where all of the qualia happens’ component was acting as a blockade between ‘thoughts’ and ‘emotions’ - it enabled thinking to happen even during intense emotions, but seems to have caused alexithymia. And ‘pushing that to the side’ makes intense emotions available to my sense of experiencing things, and thus available for thinking about/analyzing. Instead of being inaccessible.
  
  All of this happened because as a child during tantrums I set up a ‘mental space’ where I could perform logic and analysis even while having extremely intense emotions.
  This seems to be why people say that I am ‘like a robot’ or ‘an absurdly analytical person’ - because I am actively suppressing emotions through that dissociated? state. All the time.
  
  So you might get some benefit from seeing what happens if you go from disembodied to embodied for a while. And put away any tools which keep you in that state.
justinpombrio 8 Dec 2025 15:46 UTC
1 point
0
“There exists a place in your cognition that feels like an expectation but actually stores an action plan that your body will follow, and you can load plans into it.” is a valuable insight and I’m not sure I’ve seen it stated quite in that form elsewhere.

Do you have more you could say about how cognition works, or reliable references to point at?

Everything I’ve read is either true but too specific or low level to be useful (on the science end) or mixed with nonsense (on the meditation end), and my own mind is too muddled to easily distinguish true facts about how it works from almost-true facts about how it works. This makes building up a reliable model really hard.
- Jon Garcia 8 Dec 2025 21:00 UTC
  2 points
  0
  Parent
  If you can get access to the book, try reading The Intelligent Movement Machine. Basically, motor cortex is not so much about stimulating the contraction of certain muscles, but it’s instead encoding the end-configuration to move the body towards (e.g., motor neurons in monkey motor cortex that encode the act of bringing the hand to the mouth, not matter the starting position of the arm). How the muscles actually achieve this is then more a matter of model-based control theory than RL-trained action policy.
  It’s closely related to end-effector control, where the position, orientation, force, speed, etc. of the movement of the end of a robotic appendage are the focus of optimization, as opposed to joint control, which focuses only on the raw motor outputs along the joints of the appendage that cause the movement.
  You can also try diving deeper into the active inference literature if you want to build an intuition for how “predictive” circuits can actually drive motor commands. Just remember that Friston comes at this from the perspective of trying to find unifying mathematical formalisms for everything the brain does, both perception and action, which leads him to use terminology for the action side of things that is unintuitive.
  Active inference is not saying that the brain “predicts” that the body will achieve a certain configuration and then the universe grants its wish. Instead, just like perception is about predicting what things out in the world are causing your senses to receive the signals that they do, action is about predicting what low-level movements of your body would cause your desired high-level behavior and then using those predictions to actually drive the low-level movements. Or rather, the motor cortex is finding the low-level movements (proprioceptive trajectories) that the agent’s intended behavior would cause and then carrying out those movements. Again, don’t get too hung up on the “prediction” nomenclature; the system does what it does regardless of what you call it.
Michael Steele 8 Dec 2025 3:50 UTC
1 point
0
The human brain is just a wacky biological tangle, the same way that human metabolism repurposes the insanely reactive chemical byproduct of superoxide as a key signaling molecule.
It sounds like you read Petro Dobromylsky’s Hyperlipid and Brad Marshall’s Fire in a Bottle!
pku 7 Dec 2025 23:09 UTC
1 point
0
Translating this to the mental script that works for me:
If I picture myself in the role of the astronauts on the Columbia as it was falling apart, or a football team in the last few minutes of a game where they’re twenty points behind, I know the script calls for just keeping up your best effort (as you know it) until after the shuttle explodes or the buzzer sounds. So I can just do that.
Why is there an alternative script that calls to go insane? I think because there’s a version that equates that with a heroic effort, that thinks that if I dramatize and just try harder (as shown by visible effort signalling), that equates with making a true desperate effort that might actually work in a way that just calmly doing my best to the end won’t. But since I know that script is wrong, I can just not play it.
(Why does that script exist? I think for signalling reasons—going insane over something is a good way to shallowly signal I think it’s significant. But it’s not a good way to solve the underlying problem when it’s the underlying problem that needs solving, so I just choose not to do it when that’s the case.
A similar example: If I imagine seeing a news article about a child going missing, it’s easy for me to picture myself remarking “oh that’s terrible, I’m crying just imagining the parents”. If I imagine a child of mine or of a close friend going missing, my mental script’s next step is “okay track down where he was, call the police, think of more action steps”. Because there I care more about finding the child than about signalling that I care about finding the child).
- CronoDAS 8 Dec 2025 9:25 UTC
  3 points
  2
  Parent
  I thought the “going insane” thing would have been about showing everyone around you that you need help and/or are not a person able to give help to anyone else.
  - CronoDAS 10 Dec 2025 20:12 UTC
    2 points
    0
    Parent
    An example: near the end of “Saving Private Ryan”, the squad led by Tom Hanks gets into a pitched battle with some German soldiers. One of the members of the squad spends the entire battle hiding behind a building and crying.
antisocial_spam@proton.me 19 Dec 2025 3:57 UTC
−2 points
0
My method of staying sane is way less complicated.
I am not unique or special. I am human. Ergo things that keep humans sane should work on me.
So I read up on mental health and then did those things. Sleep, nutrition, exercise, sunshine, make friends, community service, clean air.
It’s likely I may still experience issues later in life. But all life is always temporary. It’s about the now, appreciating this moment when I have a dog beside me and a snoring spouse and a wool blanket and a nice book.
I can only control what I can control.
I’m learning rock carving. Rocks are awesome and last through lots of disasters.
[ ]
[deleted]

Eliezer’s Unteachable Methods of Sanity

Stay genre-savvy /​ be an intelligent character.

Don’t make the end of the world be about you.

Just decide to be sane, and write your internal scripts that way.

Stay genre-savvy / be an intelligent character.