Andrew_Critch comments on Andrew_Critch’s Shortform

Andrew_Critch 5 Mar 2026 19:45 UTC
64 points
1
What’s something you believe, that would get negative Karma if earnestly expressed in a normal LessWrong conversation? Write it in quotes. Vote on the meta-claim “would get negative karma” using ✔️/ X, where ✔️ = yes this would get negative karma, and X = no this would get positive or nonnegative karma.
What links here?
- papetoast's comment on papetoast’s Shortforms by papetoast (6 Mar 2026 2:45 UTC; 2 points)
- papetoast’s Shortforms by papetoast (20 Jan 2023 1:56 UTC; 1 point)
- the gears to ascension 5 Mar 2026 23:22 UTC
  49 points
  22
  Parent
  “Eliezer Yudkowsky being deeply irrational in some specific ways, and yet being very popular here, has always been, and continues to be, much of why the community is less effective than it could be at the things he’s interested in. If he wants to become a good influence on the world, he should be more humble and curious, and more willing to brave the gauntlet of posting on this website, rather than hiding in his twitter safe space.”
  
  Intentionally saying the inflammatory version I’d normally soften.
  
  (ninja edit: I also think he’s importantly right on important things; I’m an IABIED-pilled person, at the moment. I just also think he should try to engage with the frontier of research that IABIED-pilled people put out more regularly.)
  What links here?
  - the gears to ascension's comment on Andrew_Critch’s Shortform by Andrew_Critch (5 Mar 2026 23:16 UTC; 2 points)
  - Simon Lermen 6 Mar 2026 11:43 UTC
    8 points
    2
    Parent
    My attempt at understanding the type of reactions Eliezer doesn’t like and make him less excited about posting here on lesswrong:
    In this text, he elaborates why the AI probably won’t just spare us a few resources to keep going:
    https://www.lesswrong.com/posts/F8sfrbPjCQj4KwJqn/the-sun-is-big-but-superintelligences-will-not-spare-earth-a
    The top comment – getting 160 karma compared to 218 for the post itself – attacks him over calling people who use “Comparative advantage means humans will keep jobs” midwits: https://www.lesswrong.com/posts/F8sfrbPjCQj4KwJqn/the-sun-is-big-but-superintelligences-will-not-spare-earth-a?commentId=nzLm7giTn8JPD6bTF
    Now about a year later, look at this example of a GDM employee making a pretty flawed argument based on CA? Would you agree this is well described as “midwit” behavior overapplying maths? https://www.lesswrong.com/posts/tBr4AtpPmwhgfG4Mw/comparative-advantage-and-ai
    Would it have been better to write a diplomatic formal point that would be more likely to convince those people – or is it more important to give people a world model where they understand this type of “not so smart” reasoning is actually common in frontier labs?
    - the gears to ascension 6 Mar 2026 13:12 UTC
      16 points
      12
      Parent
      Well, I do think he could stand to be slightly more diplomatic, not enough to not say people are being foolish, but are you seriously saying that Seb fucking Krier is a midwit? like, he’s being a fool, he’s not thinking carefully, these are actions he’s taking, but “midwit” just sounds like yudkowsky has spent too much time on twitter. This isn’t actually the behavior I want yudkowsky to change most, though his abrasive style definitely has something to do with my objection to how he processes others’ claims; I also think being abrasive when necessary is important and good and one should just say what one thinks unless it’s actually unsafe to do so. But I think being brave enough to be abrasive and then, if and only if you are actually not convinced by objections, just keep being abrasive, might be closer.
      
      My actual complaint is actually not centrally about whether he posts here, I guess. The central example of why I think there’s something wrong is that IABIED seems to use more metaphor than it should. His ontology feels out of date. If he’s right, and I sure do think he is, then I wish he was able to explain why he’s right in terms that are more reliably technically insightful.
      
      Idk, maybe he just doesn’t want to accidentally push capabilities forward. I know people like that. I was much more paranoid in that direction than I am now for a long time. It could be his reason and is the main one I’d think was a reflectively good move rather than a result of human limitations. Or maybe he really only thinks of himself as a communicator now. But I’d still like him to be more able to do ontology-level updates without breaking his understanding. I want to see the yud who groks SLT and stuff like that, and sticks around here even when disagreed with
      - Seth Herd 7 Mar 2026 13:04 UTC
        2 points
        0
        Parent
        It seems like you want him to do more of everything. That’s not a reasonable request. He’s a communicator now, because he decided technical alignment was too hard.
        I disagree with that decision, but primarily because I think he’s doing more harm than good by being so abrasive as the public face of the pause and alignment is hard viewpoints. People should be doing that, just not him.
        He can’t keep up on the technical level without spending lots more time on it because he’s human. And reportedly he has chronic fatigue or despair or something, which would be pretty understandable in his position.
        Having said that, I agree with you that technical alignment is a worthwhile pursuit even if you do think alignment is hard.
        I think we should be recruiting communication specialists to do the public comms part of the project so nerds like the rest of us can shut up and do technical and conceptual work.
        the gears to ascension 7 Mar 2026 14:06 UTC
        16 points
        0
        Parent
        Nah I want him to do slightly less of what he does and slightly more of trying to keep up with research, because I think it would make his communication more able to land for technical people. This is not a fully general request, I think he has a specific blind spot about underrating the value of skimming technical work that isn’t immediately obviously relevant or is in the wrong ontology to immediately weigh on what he’s doing. And generally keeping up with subfields that feel like they should produce relevant insights even if they haven’t. Being able to speak their latest language when telling them why one thinks they’re making a mistake.
        
        It is a somewhat general claim, this is just an example. But like, I’d hope for a specific kind of research flavor curiosity to come from being slightly more humble.
        Seth Herd 7 Mar 2026 16:41 UTC
        2 points
        0
        Parent
        Oh I see. Not only do I agree, but I think this would actually get upvotes.
      - Simon Lermen 6 Mar 2026 21:41 UTC
        2 points
        0
        Parent
        Yeah I probably wouldn’t have included the birds and stones metaphor if it was up to me and would have just explained the idea
  - Seth Herd 7 Mar 2026 12:54 UTC
    2 points
    0
    Parent
    What you originally said probably wouldn’t get down voted. What you clarified to downthread probably would. It’s a much shakier claim than the more general one you lead with.
- dawnlightmelody 6 Mar 2026 21:11 UTC
  41 points
  29
  Parent
  “A rationalist community better at following its ideals would be explicitly antifascist/antiracist/antisexist/etc, and explicitly exclusionary of many fascists/racists/reactionaries/etc it currently tolerates. The community’s current norms around political tolerance and neutrality are more rooted in exclusion trauma, upper-middle-class conflict avoidance norms, and a desire to protect politically valent false beliefs from scrutiny, rather than any aid those norms bring to the community’s rationality.”
  I’ve been intending to write a more careful and less provocative version of this into a post or sequence of posts for a while, so I figured I would post the basic thesis in this somewhat safer thread to get the ball rolling. Apologies for doing the more inflammatory version first; hopefully I’ll find time to write the more careful version sometime in the next few months or so.
  - Lorxus 25 Apr 2026 4:48 UTC
    8 points
    3
    Parent
    I still find myself thinking about this and thinking about how strongly I’ve felt this for as long as I’ve felt this.
  - Viliam 18 Mar 2026 17:03 UTC
    4 points
    1
    Parent
    I disagree, but I would like to read the posts!
- MinusGix 6 Mar 2026 4:35 UTC
  34 points
  7
  Parent
  “Buddhism has been damaging to the epistemics of everyone in this sphere. Buddhism was only ever privileged as a hypothesis due to background SF/Bay-Area spiritualism rather than real merit.
  
  Buddhist materials are explicitly selected for reshaping how you think within their frames. This makes it like joining a minor cult to learn their social skills. Some can extract the useful parts without buying in, but they are notably underrepresented in any discussion (some selection effects of course). The default assumption should be that you won’t, especially as the topic is treated without notable suspicion. Most other religions are massively safer to practice for a few years, though not without their risks, as they have more ritual rather than mental molding, and more argumentation for their Rightness. You’re already primed to notice flaws in arguments. Buddhism operates more directly on your mindset, framing, and probably even values as humans are not idealized agents where those are separate.
  
  Meditation is useful, and probably doesn’t result in a lot of the central and surrounding Buddhist thought. However just like joining a cult, or playing a gacha game, you should be skeptical of Buddhism similarly as they are all Out to Get You.
  
  My less strongly held opinion is that Buddhism’s likely endpoints are incompatible with human values and often truth-seeking. This would matter less if it was treated with suspicion, just as we rightly view most religions with skepticism even while openly discussing them, but it is a gaping hole in our mental defenses.”
  
  (I agree with Ryan Greenblatt that most basically decent posts wouldn’t end up with negative karma for very long though; but I’d expect this to be decently unpopular)
  - the gears to ascension 6 Mar 2026 4:41 UTC
    6 points
    0
    Parent
    I’d like to see the full post carefully argued. Right now, I think I have one specific kind of thing about buddhism I disapprove of (which is that I believe acceptance can be bad actually and not having desires isn’t an inherent virtue) and would back my reason for arguing a similar thing, and otherwise don’t agree and you’d have to convince me.
  - Ben 7 Mar 2026 13:58 UTC
    2 points
    0
    Parent
    I remember a Scot Alexander post a while ago about Bhudism and suffering, I beleive he was asking Isur about some aspect of it. His phrasing implied that the idea that Bhudism has something important to teach us, some kind of magic juice, was to be taken very seriously. I imagine an equivalent post about Hinduism or Islam, or even Kabalistic stuff, would have used more detached ‘they beleive this stuff’ phrasing.
    I dont agree that Bhudism is somehow uniquely unhealthy to people. I do find it interesting how it seems to provoke different instinctive reactions than other religions.
- ryan_greenblatt 5 Mar 2026 23:38 UTC
  21 points
  12
  Parent
  Meta: not that much stuff that is contentful gets negative karma in isolation, only as a response IMO. Like negative karma is way more likely for things that are responding in a way people think is bad/reasonable than things that are just unreasonable statements in isolation.
- TsviBT 5 Mar 2026 22:32 UTC
  18 points
  28
  Parent
  I’m genuinely unsure about the voting, but:
  
  “(In most relevant senses, with substantial translation work and ontological sophistication) God is good and real, (some) religion is both good and true relative to what we have, and many of the classic “new atheist” arguments are bad and the religious counterarguments are largely correct, and LessWrongers (as well as many others) are in the final evaluation being irrational in their allergies to this, and humanity would benefit from investing to make good conceptual progress on this. See https://tsvibt.github.io/theory/index_What_is_God_.html ”
  - Mo Putera 6 Mar 2026 0:25 UTC
    4 points
    0
    Parent
    Any relation to zhukeepa’s views?
    - TsviBT 6 Mar 2026 0:53 UTC
      4 points
      0
      Parent
      The relation is roughly
      
      haha. (Kinda kidding, but also I disagree with whatever of those views I’ve seen, to a great enough extent that I think it would just not be enlightening to compare them as similar.)
- Noosphere89 8 Mar 2026 17:23 UTC
  16 points
  9
  Parent
  AI existential risks, especially extinction risks from a long-termist perspective are now way overfunded compared to better futures work, and longtermism properly interpreted agrees with the common view amongst the general public that sub-existential catastrophes that collapse civilization are at least as important as risks that kill everybody, and are more important to prevent in practice than extinction risks.
  One major upshot of this is that bio-threats, wars that can collapse civilization entirely, or other threats that kill off a large fraction of the population but don’t make them extinct, especially coming from AI is quite a bit more important to prevent than classical AI risk scenarios, and probably deserve more funding than current AI safety.
  Related to this, the maxipok heuristic is a bad guide to action, because expected (and quite likely the actual distribution) distributions of futures are nowhere near as dichotomous as some people think, and because the probability of AGI this century is quite high, it’s quite likely that non-existential interventions persist.
  A better heuristic is to instead focus on a wider portfolio of grand challenges, which were defined in the article as decisions that could affect the value of the future by at least 0.1%, and another better heuristic related to long-term alignment of ASI is to scrap the Coherent Extrapolated Volition target and instead make ASIs execute optimal moral trades.
- Kongo Landwalker 6 Mar 2026 16:49 UTC
  15 points
  21
  Parent
  “Lesswrong community underestimates the risk of nuclear ww3 and overestimates the chance of humanity extinction due to AI”.
- interstice 6 Mar 2026 0:51 UTC
  15 points
  5
  Parent
  “quantum mechanics is probably important to the structure of agency/the mind in some way we don’t understand yet”.
  - AprilSR 6 Mar 2026 7:46 UTC
    5 points
    3
    Parent
    If I had to guess I think it’s relevant to like, anthropic reasoning, or something.
    - interstice 6 Mar 2026 17:28 UTC
      4 points
      0
      Parent
      Yeah it’s more like it’s relevant to the kind of world we find ourselves in. But that is itself important to agency as a given agent design will only be successful in certain kinds of worlds.
  - Mateusz Bagiński 6 Mar 2026 15:54 UTC
    2 points
    0
    Parent
    Quantum mechanics or the [math behind]/[logic underlying] quantum mechanics? I find the latter much more plausible than the former.
    - interstice 6 Mar 2026 17:27 UTC
      2 points
      0
      Parent
      I think QM itself. It’s important somehow that the world is actually quantum mechanical. But probably not in a very direct fashion, but via influencing the sort of high-level properties and entities that end up “emerging” from the base laws.
      - Mateusz Bagiński 7 Mar 2026 13:25 UTC
        2 points
        2
        Parent
        Yeah, ok. I disbelieve this and am interested in hearing legible reasons for why somebody thinks this is likely.
- Noosphere89 7 Mar 2026 19:34 UTC
  14 points
  9
  Parent
  The counting arguments for misalignment, even if they were correct do not show that AI safety is as difficult as some groups like MIRI claim without other very contestable premises that we could attempt to make false.
- programjames 5 Mar 2026 22:59 UTC
  14 points
  17
  Parent
  “People often submit incredibly epistemically rude and short-sighted comments on forums, but they deceive people into upvoting them by putting on a veneer of politeness. ‘John, I feel like you’ve got a nail in your head.’ they say. ‘Your conclusion is wrong so you must not have thought of this thing you explicitly mentioned in your post.’”
- Jonas Hallgren 6 Mar 2026 8:09 UTC
  11 points
  2
  Parent
  Posting things that are adjacent in frame but implies beliefs that are more associated with AI Ethics or normie crowd. E.g let’s say someone does a deep dive into John Rawls A Theory of Justice (fictional example but I’ve seen similar) and doesn’t preface it with relating it to some sort of decision theory or similar it is often assumed that it is not meant for the LW community as it doesn’t make the connections clear enough. I’m not sure this is only a bad thing but sometimes I find that it signals a lack of good faith in accepting other people’s frames?
- Gunnar_Zarncke 6 Mar 2026 20:47 UTC
  10 points
  1
  Parent
  “evidence for the singularity is evidence for theism being true”
- the gears to ascension 5 Mar 2026 23:20 UTC
  8 points
  2
  Parent
  “This post is bad and hard to evaluate, so I asked an AI to do so. Here’s what the AI said: [result]”
  
  (edit: I do in fact think this should be not a downvote-worthy thing to post actually. But I’ve been downvoted every time I tried!)
- faul_sname 5 Mar 2026 22:13 UTC
  7 points
  8
  Parent
  “It is better to have a large number of self-replicating AI agents now which can only operate by taking advantage of the affordances granted by industrial civilization than it would be to prevent any AI self-replicators until such a time as they can spin up an entirely independent chip-fabricaiton-capable industrial stack”.
- quetzal_rainbow 6 Mar 2026 10:04 UTC
  6 points
  1
  Parent
  “75% of karma and engagement received by alignment optimists people is explained by politics (‘it would be bad if AI optimists stopped visiting LW due to low engagement’) and epistemic modesty up to contrarian fetish (‘sure, their arguments sound bad, but what if we are in echo chamber?’), not because their positions and arguments are genuinely good”
- programjames 5 Mar 2026 22:58 UTC
  6 points
  10
  Parent
  “The rationality scene is a little culty.”
  - Karl Krueger 6 Mar 2026 0:56 UTC
    5 points
    0
    Parent
    People have talked about the rationality scene being culty for as long as there’s been a “rationality scene”. There was the awkward period where people used the word “phyg”....
  - Viliam 19 Mar 2026 15:27 UTC
    2 points
    0
    Parent
    The controversial part is “culty” or “a little”?
  - Mo Putera 6 Mar 2026 0:29 UTC
    2 points
    0
    Parent
    In this sense?
  - the gears to ascension 5 Mar 2026 23:16 UTC
    2 points
    0
    Parent
    I think, as with most things, this is mostly a phrasing issue. I’ve said things equivalent to this before, I only think this one would be downvoted because of being a bit low on specificity. The rationality scene does seem a little culty, but I think the structure of where the cultiness is is not as bad as some scenes where cultiness levels are higher but less readily discussed or not considered bad by as many participants. Which is very much not to say things are fine, my usual claim is the rationality community is a vaguely secular religion that has produced actually toxic spinoff cults. Ooh, wait, I do have a real one, inspired by this—eliezer
- Pedro Freire 7 Mar 2026 0:29 UTC
  5 points
  4
  Parent
  I have enough beliefs that would earn negative karma if earnestly expressed in a normal LessWrong conversation to make this website not worth participating in for me.
- Taylor G. Lunt 6 Mar 2026 3:58 UTC
  4 points
  0
  Parent
  “Rationalism is a euphemism for autism (or the “broader autism phenotype”), and LessWrong is an autism club for adults. And the rationalist ideology is essentially a reification of typical autistic preferences.”
  - ChristianKl 6 Mar 2026 9:18 UTC
    4 points
    2
    Parent
    Rationalism is a term that’s used by different people to mean different things.
  - interstice 6 Mar 2026 7:58 UTC
    2 points
    0
    Parent
    [I don’t actually think this is true, but] It would be funny if rationalism turns out to not merely be a euphemism for autism but “mal”functioning oxytocin receptors and rationalists are constitutionally unable to normally feel love/social emotions; whether this would be to the discredit of love or rationalism is up to taste.
  - Richard_Kennaway 6 Mar 2026 19:03 UTC
    −4 points
    −3
    Parent
    Autistic is a dysphemism for sane. Smart, looks at reality, acts effectively.
    
    (Insert Heinlein quote about shining like the Sun vs. a candle.)
    
    ETA: Now I’ve had time to look it up:
    
    “Such men exist, Joe; they are New Man— human in all respects, indistinguishable in appearance or under the scalpel from Homo sap, yet as unlike him in action as the Sun is unlike a single candle.”
    
    ETA2: I’m talking about the present-day use of the word by the general public and the media, not its historical origins or use by psychiatric specialists.
- G Wood 5 Mar 2026 22:16 UTC
  4 points
  −3
  Parent
  One of my particular moral rules is “It is good to intervene in the world to move it towards a state your morality would approve of”
  The intuition pump is:
  ″
  You live next door to a couple. In their moral framework, a husband has the right and duty to discipline his wife physically. She agrees, it’s how she was raised, it’s what she believes is proper. You are fully aware that this is their moral framework and they are aware of your moral framework, you have common knowledge.
  You hear him beating her through the wall.
  Your morality says this is wrong. Theirs says it’s right. Neither of you can appeal to a universal referee or the police, there isn’t one in this hypothetical scenario.
  Do you intervene?
  (This doesn’t have to be fully getting into a punch up, it could be moseying on over there and having a wee chat about it. It could be threatening to withhold favours in future like helping him install a retaining wall or some such, idk. the point is would you act against what they think is right because of what you think is right?)
  ″
  The full argument might be summed up as “I don’t claim cosmic authority, I claim my judgment, I act on it, and reality is the referee” Outcome is everything.
- Haiku 9 Mar 2026 21:13 UTC
  3 points
  2
  Parent
  “A significant portion of LessWrong users (at least 20%) care more about the aesthetics of rationality than they do about humanity. It’s a rationality ouroboros. They use the power of rationality to pursue their values, and their values favor protecting aspects of their niche rationality practice over and above taking action that will prevent my and their loved ones from being killed. This forum is a room full of unarmed scouts getting gunned down by the soldiers working at the AI Labs. The only people on this platform who can credibly claim to care about humanity are the ones who actively oppose those soldiers, as soldiers.”
- Gunnar_Zarncke 6 Mar 2026 20:50 UTC
  3 points
  4
  Parent
  “longevity while individually desirable may not be stable socially”
- testingthewaters 6 Mar 2026 19:02 UTC
  3 points
  −2
  Parent
  “Love is the most powerful force in the world.”
- Taylor G. Lunt 6 Mar 2026 3:53 UTC
  3 points
  0
  Parent
  “There exists information which would drive you (yes you) to madness if you comprehended it.”
  - Jasper Blank 6 Mar 2026 10:48 UTC
    3 points
    0
    Parent
    To me the next question would be is, does there exist True information that would do the same
    - Canaletto 7 Mar 2026 11:40 UTC
      3 points
      1
      Parent
      Likely trivially true, can setup a scene where people recite cognitohazards and then tell you about it. Or something in that neighborhood. Like, “It’s >99.99% likely that this arrangement of atoms exists in the Sun’s plasma: 10100011011000101111010100110100111010110” and you get a psychotic break.
- programjames 5 Mar 2026 22:58 UTC
  3 points
  −6
  Parent
  “Utilitarianism and selfish egoism are mathematically the same [EDIT: i.e. they could be used as synonyms except for their different connotations].”
  - Taylor G. Lunt 6 Mar 2026 3:48 UTC
    1 point
    1
    Parent
    This means the actions that maximize wellbeing for all are always equivalent to the actions that improve my own self-interest? How is this not just straightforwardly false? Any time I act against humanity, I am also acting against my own self-interest? Unless you do some funny definition of self-interest, this cannot be true.
    
    E.g. two buttons: red button sends you to hell for a million years, green button sends everyone else in the universe to hell for a million years. Self-interest, if the term means anything at all, requires you to hit the green button, but utilitarianism obviously demands the opposite.
    - programjames 6 Mar 2026 4:27 UTC
      2 points
      0
      Parent
      Well do you care about the rest of humanity enough to send yourself to hell? Or adopting policies where you only get sent to hell in universes rather than ? Seems like a smart selfish egoist would send themselves to hell.
      - Taylor G. Lunt 6 Mar 2026 5:24 UTC
        2 points
        0
        Parent
        “Well do you care about the rest of humanity enough to send yourself to hell?” Nope. Also, even if I did endorse that decision, it probably still wouldn’t be in my own interest. IMO that decision would be a simple mistake with respect to my self-interest. My empathy is not powerful enough for avoiding some guilt to be worth a million years of torture.
        “Or adopting policies where you only get sent to hell in X universes rather than Y?” In the hypothetical, there is only one universe and two buttons. Any other universes are figments of my imagination. You’re suggesting I imagine a veil of ignorance, and make make moral decisions from behind the veil of ignorance. But assuming a veil of ignorance assumes utilitarianism = egoism, which is what you’re trying to prove. In reality I have one life and I know where I stand in life. I don’t need to make decisions from behind a veil of ignorance. I can steal knowing it makes me richer, without having to wonder whether I’ll end up the thief or the victim. I know I’m the thief, because I’m the one choosing to steal.
        programjames 6 Mar 2026 10:57 UTC
        1 point
        0
        Parent
        So, it seems you endorse a utility function that puts more weight on others than your actual preferences. Wouldn’t you prefer to endorse a different utility function?
        Taylor G. Lunt 7 Mar 2026 5:15 UTC
        1 point
        0
        Parent
        I don’t understand what you mean.
- Canaletto 6 Mar 2026 11:39 UTC
  2 points
  −4
  Parent
  “The usual sleep is death actually. You just get resurrected in the most likely place for you to be resurrected, your waking body 8 hours later.”
  People start complaining that this abuses the word death but then refuse to enter destructive teleports.
- programjames 6 Mar 2026 11:00 UTC
  2 points
  0
  Parent
  “You, not all of you but most of you, should not be working on AI safety.”
- Taylor G. Lunt 6 Mar 2026 4:10 UTC
  2 points
  10
  Parent
  “Prediction markets will be net bad for society.”
- Taylor G. Lunt 6 Mar 2026 4:02 UTC
  2 points
  3
  Parent
  “The intelligence of the smartest AI systems is still somewhere between that of a worm and a squirrel.”
  
  Assuming you could develop a more robust measure of intelligence than IQ and administer the test appropriately to an AI. I’m talking about general intelligence, making all the assumptions you have to make to assume a single factor of intelligence.
- Horosphere 5 Mar 2026 22:45 UTC
  2 points
  −14
  Parent
  Comment withdrawn.
  - TsviBT 6 Mar 2026 2:52 UTC
    2 points
    0
    Parent
    :) https://tsvibt.blogspot.com/2025/11/forum-poweruser-forum.html
  - Taylor G. Lunt 6 Mar 2026 3:49 UTC
    0 points
    0
    Parent
    Most of the content on this website is more interesting and engaging than a bunch of downvote-explanation comments would be.
    - jimmy 6 Mar 2026 19:01 UTC
      3 points
      0
      Parent
      The more interesting argument for that norm is that it makes people accountable for their downvotes and therefore less likely to give dishonestly motivated downvotes.
      
      There’s still no obligation to upvote anything, so if it’s plainly visible that a post is bad and no one cares to explain why it’ll just sit at 0. Downvotes become important when some people (incorrectly) think a post is good, because then it will accrue a positive score if uncorrected. But in that case the downvoter thinks they understand something the upvoters don’t, so maybe they should be explain.
      
      The problem with downvotes without accountability is that if I post something about how people named Taylor are statistically likely to be <bad thing>, it might be true and well supported and important and empathetic… and still you could still just downvote it to censor what you don’t like while most people don’t care enough either way to vote. So we get good posts systematically suppressed in ways that wouldn’t happen if you had to comment “Downvoted because my name is Taylor”, whenever a minority is hostile to a particular truth.
      
      Downvote explanations could be hidden by default so it wouldn’t spill over, but I find myself frequently expanding “comment scored below threshold” to see what it is the community in question really doesn’t want people to think. These comments are rarely boringly bad.
- G Wood 5 Mar 2026 21:56 UTC
  2 points
  2
  Parent
  “Morality is a constructed / evolved coordination technology, and we can evaluate specific implementations by how well they achieve the coordination function.”
  - Horosphere 5 Mar 2026 22:42 UTC
    3 points
    2
    Parent
    Comment withdrawn.
    - the gears to ascension 5 Mar 2026 23:13 UTC
      2 points
      0
      Parent
      Are you saying something different than G Wood? It feels fundamentally similar.
      - Horosphere 5 Mar 2026 23:20 UTC
        3 points
        0
        Parent
        Comment withdrawn.
        the gears to ascension 5 Mar 2026 23:25 UTC
        3 points
        0
        Parent
        a previous comment I’ve made on the topic in which I argue that the evolution statement G Wood said is the referent your moral realism statement most naturally refers to anyway
        Horosphere 5 Mar 2026 23:32 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        the gears to ascension 5 Mar 2026 23:46 UTC
        2 points
        0
        Parent
        Despite my intuitive approach to logical thinking being somewhat explosion-proof, I don’t think I can evaluate the counterlogical “the experiences of all conscious beings were inverted” in a way that is meaningful here; in my intuitive representation it seems to be the case that the variable “positive/negative valence of experience of all conscious beings” is causally efficacious, so inverting it would have the effect of making those beings avoid the negative valences; my primary candidate intuitive sketch for what this variable boils down to is “something information theoretic, possibly literally just any increase in entropy that was trying to be controlled away”. The logical concept I was describing contains all possible minds, and so should depend on the structure of those minds in their origin universes in order to make sense; my claim that it is objective is that I believe you likely can generalize across all minds in all universes with compatible basic properties^[1], and get something that makes sense. I agree with you that there’s likely an underlying basic valence fact, but I think that that valence fact is causally entangled, and I also believe that it only “matters morally” due to the way it affects minds in the “junior rooms” in the “interdimensional council of cosmopolitanisms”.
        (@G Wood see this subthread as answer to your question)
        ^
        (eg, universes without important conservation laws might be too alien for the same moral properties to apply, or something; generally, there might be a class of sufficiently-similar physics and a broader class of too-different physics, where the sufficiently-similar physics produces minds that, if they “visit the interdimensional council of cosmopolitanisms”, they find themselves unable to translate to and from the views of minds in universes with no conservation laws or halting oracles or something.)
        Horosphere 6 Mar 2026 0:01 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        the gears to ascension 6 Mar 2026 0:20 UTC
        2 points
        0
        Parent
        so, like, background: let’s say that the “interdimensional council of cosmopolitanisms” is the space of minds that have cosmopolitan inclinations; I expect this to be a natural group to “flood fill” because imagining one makes you think through what they imagine, which means you get a transitive effect, if you weren’t going to imagine a world you consider to be a hellworld, but a mind you think is in a similar-ish universe to you does think it’s important to imagine the hellworld, then as long as your approach to mapping mindspace is sufficiently efficient, you’ll notice that that mind would consider that hellworld, and think through what goes on in that hellworld. that’s a necessary premise, because otherwise you don’t get enough coverage of mindspace if you start from “minds that have cosmopolitan inclinations and you find natural to imagine”, call that your IDCC entrypoint; that’s already a filter, and it needs to end up being sufficiently inclusive for this idea to work, and then it needs to do a second, transitive filter on the remaining minds, so as to pick a moral coalition that actually covers the space and identifies the moral properties on which there are consensus.
        
        okay, so then there are different sorts of minds in the IDCC. I claim that among those minds are very proto-mind-ish things, like bacteria, or individual neurons, or whatever you think the threshold is; even if your seed minds don’t import them, as long as your IDCC entrypoint includes me, then because my process of “visiting the IDCC” involves thinking through my individual neurons and individual proteins as having mind-ness that aggregates up into my full mind, I end up importing individual neurons and proteins as being things I consider to be intelligent and worthy of having moral behavior with each other, for the reason that doing so seems to be what carries valence for my aggregate mind.
        
        And so when I consider how both the me and the components of me get those negative valence experiences, and I think through the causal path to achieving them, they seem to be fundamentally causally entangled with physics in some way; that is, the negative valence is not merely because, but is made of, the physical state of my neuron being in some way informationally degraded, such that the neuron and the brain it’s in both operate worse until the physical issue is resolved. The “neurons room” of the IDCC, where neurons are considered to be individual minds, has larger minds like myself “enter the room” and ponder the neurons and bacteria and other single cells inside that room, and these larger minds find a structure in the neurons where their negative valences reliably relate to an information theoretic property.
        
        So to invert the valence, my sense is you’d need to invert that property.
        
        But inverting that property seems to break the mind; if the mind isn’t broken, the property is not inverted, because the marginal brokenness is the marginal negative valence, and so inverting the mind so that positive things are negative requires those positive things to be made of noise or something like that; it requires those positive things to be made of brokenness at some relevant scale.
        
        The only way I see to achieve “a mind appears to be me having a good time, but is actually having a bad time” is if you can make a mind which is made of brokenness but is just barely functioning, and that mind is coordinating to become me at a slightly larger scale, without leaking the bad-time-at-the-small-scales into good-time-at-larger-scales. so the mind having a good time is still objectively real, in the same way a wave is objectively real whether it’s carried on water or on a computer running a fluid sim. The wave really does move between the coordinates of the system through the locality of interaction, even if those coordinates are folded up into a ram chip.
        Horosphere 6 Mar 2026 0:47 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        Expand this thread
        the gears to ascension 6 Mar 2026 2:26 UTC
        2 points
        0
        Parent
        I believe that the hard problem of consciousness boils down to “why is there something rather than nothing, from my perspective, right now as I write this or think this?” and that the “okay, but why are things good and bad?” portion is going to turn out to be an unprivileged additional layer imposed by easy-problem-consciousness stuff. I do believe that easy problem stuff is information processing, but I believe it in the sense that there are informational elements—the fundamental building blocks of the universe—and those elements’ informational state is exactly their structural state; and the hard problem of consciousness resides in an unresolveable question of “why should any building block exist locally at all?”. Or in other words, localitypilled something-rather-than-nothing as being the same question as “why does my perspective exist”.
        
        And so I don’t really think the hard problem is terribly relevant. I’m not at all saying it’s easy or doesn’t exist, and I do think people who say that are missing something. But I don’t believe p-zombies can exist in a real universe, because “realness” being missing is the thing that makes something a p-zombie; I think that we are beyond the reach of god, but also have this weird thing where we actually exist. a p-zombie would say the same thing, the math that defines its universe also fully specifies that it would be confused by existing, but since (by definition) it exists in the math sense but not in the actuality sense, it never gets run. in other words, I’m a structural realist who also believes there’s something underneath the structures, but that it’s beyond our reach to know what it is, and that we are doomed to always wonder “why” there is something rather than nothing.
        
        the mind I describe there having a good time overall: I dunno, you could make the host mind pretty huge, and then probably not. It depends on the ratio of how much stuff is happening in the host vs happening in the guest.
        
        The isolation I was talking about is the same kind that happens for virtualization on computers. An example of leaking would be if the external sound driver has buffer underruns and these cause buffer underruns in the guest, for example (not 100% this can happen, but I think so). and similar such things. or even, if the host has faulty ram, the guest will also. those would be leaks. if those aren’t happening, then if the guest is running smoothly as far as it can tell, but the host is actually swapping like mad and the cpu is overwhelmed and ram is full and the hard disk is taking a long time to do anything, then if the guest’s clock is not realtime, it could in principle be unable to tell anything is wrong. that’d be the isolation at hand.
        Horosphere 6 Mar 2026 10:47 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        the gears to ascension 8 Mar 2026 5:28 UTC
        2 points
        0
        Parent
        What I described as host is also known as a simulator (device), and the guest is the simulated thing, simulatee. Simulation does not exit the realm of the physical, it just hides that there are smaller scale simulator/host elements from the simulatee/guest. I don’t see how the host level could be unaware, but the guest could be.
        
        I think I might be a constrained sort of platonist. I don’t think every logical referent we can hypothesize in the language of our logic, which we can say “math-exists”, has to be a real thing which exists outside of our description of it. I do think our universe seems like it ought to be one of many actually real possibilities in a weak remark 4 multiverse, despite that the others can’t be confirmed to exist in a physical sense by us; but I’m not convinced that a full tegmark 4 multiverse is required, where all logically consistent referents exist.
        
        Another way to put it is that logic is in the business of determining what you can say must be true in a given space, starting from some axioms and validity rules; the “really exists” I’m proposing here would be our actual universe’s truth fact. When one uses logic to describe objects which existed prior to writing logic on a page, you’re attempting to preserve the truth of facts; if I have an apple, and I have a banana, then I have an (apple banana). But those names could refer to anything; our universe seems to provide us with actual substrate. A mind structure in another universe which does not exist on any actual substrate physically would have the same confusions I’m expressing and only does not do so due to not being instantiated. This substrate is sometimes called compute or reality fluid, and I’m proposing it also is sometimes called hard-problem-consciousness.
        
        Perhaps my view is vacuous because all logically consistent structures exist and there is nothing which could separate an “underlying substrate” of those structures; then perhaps my view could be technically vacuously correct but really just be “structural realist platonism with tegmark 4″, or so.
        
        But, hence why I think you can describe minds, and describe what they would do if they existed, without knowing if they’re real outside your description. Under this view, describing a mind makes it real/exist/hard-problem-companies to the extent you describe it, by carrying it on the same substrate that carries you. You can never meet a p-zombie, in this view. You can only meet minds which really exist, but which are missing features. That’s where I think current AI fall, for example.
        
        None of what I’ve said in this comment so far directly weighs on your moral valence realism question. I do agree that it’s likely a very small and primitive fact if it’s a general one at all. I haven’t thought as much about it to be able to describe it eloquently, the rest of my comment is reciting views rather than anything new right now. I’ll ponder it.
        [ ]
        [deleted]
        G Wood 6 Mar 2026 0:26 UTC
        −1 points
        0
        Parent
        I like this, well written sir. it feels very similar to my position. I’ve made no claims of convergence like you have but I could certainly see myself agreeing. I need to think on it.
      - G Wood 5 Mar 2026 23:27 UTC
        1 point
        0
        Parent
        Thats funny, I would not consider them similar, what lead you to that feeling? Am I missing an interpretation?
        Mine is a definition of what morality is plus a way of determining the merits of a moral system if you accept my definition.
        Horosphere is making the claim that morality is objective, by which I assume he means that there are things that are universally good vs universally bad in such a way that a mind is uneeded to judge goodness or badness.
    - G Wood 5 Mar 2026 23:09 UTC
      1 point
      0
      Parent
      I’m interested in what you mean by reasonable definitions.
      Also, you’ve basically said “Morality is objective” but with hedging, do you agree? Your position is reasonable and held by many.
      I’m however separating what morality is, from what a particular moral system classifies as good or bad. Distinguishing the classifier from the output. I would say our statements are fundamentally incompatible rather than strictly opposite.
      - [ ]
        [deleted]
  - joseph_c 6 Mar 2026 6:03 UTC
    2 points
    0
    Parent
    I would call that ethics, not morality. I personally distinguish ethics from morality in that ethics is how a society works together despite people wanting conflicting things, while morality is about achieving the most good (however you define “good”). I don’t think this is an official distinction, but I do think it’s useful to distinguish the two concepts.
    - G Wood 6 Mar 2026 19:15 UTC
      1 point
      −1
      Parent
      Hey fair enough, no argument if you find it useful. I dont really see much of a distinction
      Morality usually refers to what i would call a particular moral system, a set of first order normative commitments, what you actually believe is right / wrong or good / bad. It’s the object level rules “killing is bad,” “honesty is good.”
      Ethics is in common parlance the “study” or “science” of figuring out the deeper reason something is right or wrong. Unfortunately in my model that boils down to “large groups of people think it is good or bad, right or wrong”. Hey all of the wrong models have to live somewhere!
      Im still thankful to the philosophers who study ethics, it’s a wonderful thing to try to get to the roots of things and I wouldn’t have been able to understand without reading their work.
- Jemal Young 6 Mar 2026 17:24 UTC
  1 point
  −9
  Parent
  “AI alignment might be doable in the short term but ultimately unsustainable, because humanity might find itself inside an increasingly complex layering of automated research/monitoring/control systems, with each layer interfacing with a more capable layer on one side and a less capable layer on the other, and as this layering accumulates the nougaty center’s awareness of / influence over the outermost layer (the thing that needs to be aligned) will approach zero.”
- G Wood 6 Mar 2026 0:36 UTC
  1 point
  1
  Parent
  “It would be a good idea to invite those with ideas deviant from the lesswrong orthodoxy to out themselves by posting their heretical thoughts in public so we may excise them later” 😈
  For those unable to recognise jokes in text form (a lot of us), this is a joke.
- the gears to ascension 5 Mar 2026 23:19 UTC
  1 point
  5
  Parent
  “This post [sucks/is bad] and you shouldn’t have posted it. Please delete it.”—not reliable, sometimes people agree, but I think people typically downvote when I’ve said things like that.
  
  edit: I do not believe this about all posts, just some. I’d probably phrase it differently in those circumstances, but it is often the case that I think someone is just burning utility by posting something. amused that my comments are at −1, I guess I’m the only one who was reflectively right so far!
- programjames 5 Mar 2026 22:32 UTC
  1 point
  −2
  Parent
  EDIT: Separated into multiple comments.
  - Andrew_Critch 5 Mar 2026 22:53 UTC
    2 points
    0
    Parent
    Can you make these separate comments? Otherwise people can’t vote on them.
  - Horosphere 5 Mar 2026 22:44 UTC
    1 point
    0
    Parent
    Comment withdrawn.
    - programjames 5 Mar 2026 23:02 UTC
      1 point
      0
      Parent
      It should be consistent with any decision theory.
      - Horosphere 5 Mar 2026 23:09 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        programjames 5 Mar 2026 23:18 UTC
        1 point
        0
        Parent
        Not just egoism, selfish egoism. Every utility function people choose is a selfish one or they wouldn’t choose it. The claim isn’t, “selfish egoism is a subset of utilitarianism” but “selfish egoism is identically the same as utilitarianism.”
        yudhister 6 Mar 2026 1:25 UTC
        2 points
        0
        Parent
        This argues that utilitarianism is selfish egoism, but not the contrary? My reading of your position is that someone who had a utility function not dependent on the wellbeing of any other beings would be a selfish egoist, but it’s difficult for me to understand how that could be utilitarian.
        programjames 6 Mar 2026 2:00 UTC
        1 point
        0
        Parent
        How do you determine which beings ought to be in a utilitarian’s utility function? I think it’s generally the utilitarian decides for themselves and the rest of society beats them over the head until the utilitarian includes them too.
        Horosphere 5 Mar 2026 23:24 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        programjames 5 Mar 2026 23:47 UTC
        1 point
        0
        Parent
        I don’t understand what you don’t understand. I heard a remark once about a philosopher who really tried to steelman other people’s arguments, but so that they made sense according to the philosopher, not in the mental frame of the other person. It led to some pretty wacky arguments on the steelman side. I think here, you should assume when I say, “mathematically equivalent,” that’s what I mean. Like, any math you use in utilitarianism is the same as that of selfish egoism. Or, if you tried to put the two philosophies in mathematical terms, you get the exact same equations. So, it extends to logical beings or irrational beings. The words “selfish egoism” and “utilitarianism” are synonyms.
        Horosphere 5 Mar 2026 23:53 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        programjames 5 Mar 2026 23:59 UTC
        1 point
        0
        Parent
        Yes.
        Horosphere 6 Mar 2026 0:09 UTC
        1 point
        0
        Parent
        Comment withdrawn.
        Expand this thread
        programjames 6 Mar 2026 0:40 UTC
        1 point
        0
        Parent
        Perhaps here is where the controversy comes in. The utilitarian comes along and says, “I want to maximize utility!” And everyone thinks, “great! she wants to help everyone out!” The selfish egoist comes along and says, “I am just going to fulfill whatever selfish desires I have!” And everyone thinks, “wow, that’s scary! what stops you from murdering people?”
        
        I think, also, there is a sense in which utilitarians work to maximize the same utility function. This is also true for selfish egoists, but they’re both better and worse at negotiating (they are more prone to negotiate, but utilitarians make mistakes that are biased towards reaching a consensus just because they solve the problem from different directions).
        Horosphere 6 Mar 2026 0:56 UTC
        2 points
        1
        Parent
        Comment withdrawn.
        programjames 6 Mar 2026 1:54 UTC
        2 points
        0
        Parent
        Sorry, I don’t really want to make this a long thing. I have written a little on this elsewhere (1, 2, 3).
  - [ ]
    [deleted]
  - [ ]
    [deleted]
  - [ ]
    [deleted]
- Ryan Meservey 5 Mar 2026 22:05 UTC
  0 points
  2
  Parent
  “This quick take will get few to zero comments because the vast majority of LW-ers believe even their most idiosyncratic beliefs would garner positive karma if earnestly expressed.”
  *Edited to separate my views. Bonus view to follow
  - G Wood 5 Mar 2026 22:21 UTC
    10 points
    0
    Parent
    Separate them out lol, that way I can more clearly disagree with one of your statements while agreeing with the other ;). Well i mean, i disagree with both :D
    - Ryan Meservey 5 Mar 2026 22:24 UTC
      2 points
      0
      Parent
      As one of the commenters to this quick post, I expect you would disagree. XD
      - G Wood 5 Mar 2026 23:02 UTC
        1 point
        0
        Parent
        Made me laugh :D. I do agree with “a belief in positive communal response to earnestness is important for any truth-seeking group”, sadly I don’t believe that standard is achieved even here in the lofty heights of less wrong, where at least we try.
        but “This state of affairs is non-problematic” is an issue, if this post got no comments, that means that less wrong is a total monolith where everyone thinks the same, that’s not good for truth seeking.
  - Eli Tyre 6 Mar 2026 18:27 UTC
    2 points
    0
    Parent
    Seems falsified?
  - Ryan Meservey 5 Mar 2026 23:04 UTC
    2 points
    0
    Parent
    Bonus view: “Assuming it were the case that LW-ers did not comment on this post and expected positive karma from earnestness, this would be non-problematic because 1) a belief in positive communal response to earnestness is important for any truth-seeking group, and 2) individuals often form their beliefs by imagining the responses of their respected peers to those beliefs and roleplaying peer reactions to different propositions is a useful exercise.”
  - Andrew_Critch 5 Mar 2026 22:57 UTC
    2 points
    0
    Parent
    Can you separate these into separate comments, so people can vote separately on them?
- the gears to ascension 5 Mar 2026 23:10 UTC
  −1 points
  1
  Parent
  After ten minutes of thinking, everything I’m thinking of I could respond with, I either have one of two kinds of other reasons to not say publicly right now, or have previously in fact posted and managed to phrase in ways that got many downvotes on gross, but on net were above zero.
  
  edit: found some good ones after 20m of thinking.
- [ ]
  [deleted]