Wei Dai comments on Wei Dai’s Shortform

Wei Dai 18 Nov 2025 3:57 UTC
19 points
26
Ok I misunderstood your intentions for writing such posts. Given my new understanding, will you eventually move to banning or censoring people for expressing disapproval of what they perceive as bad or unfair moderation, even in their own “spaces”? I think if you don’t, then not enough people will voluntarily leave or self-censor such expressions of disapproval to get the kind of social legitimacy that Eliezer and you desire, but if you do, I think you’ll trigger an even bigger legitimacy problem because there won’t be enough buy-in for such bans/censorship among the LW stakeholders.

If you don’t like it build your own forum that is similarly good or go to a place where someone has built a forum that does whatever you want here.

This is a terrible idea given the economy of scale in such forums.
- habryka 18 Nov 2025 4:09 UTC
  2 points
  1
  Parent
  Given my new understanding, will you eventually move to banning or censoring people for expressing disapproval of what they perceive as bad or unfair moderation, even in their own “spaces”?
  I mean, I had a whole section in the Said post about how I do think it’s a dick move to try to socially censure people for using any moderation tools. If someone keeps trying to create social punishment for people doing that, then yeah, I will ask them to please do that somewhere else but here, or more likely, leave the content up but reduce the degree to which things like the frontpage algorithm feed attention to it. I don’t know how else any norms on the site are supposed to bottom out.
  Top-level posts like this one seem totally fine. Like, if someone wants to be like “I am not trying to force some kind of social punishment on anyone, but I do think there is a relevant consideration here, but I also understand this has been litigated a bunch and I am not planning to currently reopen that”, then that’s fine. Of course you did kind of reopen it, which to be clear I think is fine on the margin, but yeah, I would totally ask you to stop if you did that again and again.
  - Wei Dai 18 Nov 2025 5:01 UTC
    9 points
    2
    Parent
    
    I mean, I had a whole section in the Said post about how I do think it’s a dick move to try to socially censure people for using any moderation tools.
    
    I think an issue you’ll face is that few people will “try to socially censure people for using any moderation tools”, but instead different people will express disapproval of different instances of perceived bad moderation, which adds up to that a large enough share of all author moderation gets disapproved of (or worse blow up into big dramas), such that authors like Eliezer do not feel there’s enough social legitimacy to really use them.
    
    (Like in this case I’m not following the whole site and trying to censure anyone who does author moderation, but speaking up because I myself got banned!)
    
    And Eliezer’s comment hints why this would happen: the comments he wants to delete are often highly upvoted. If you delete such comments, and the mod isn’t a neutral third party, of course a lot of people will feel it was wrong/unfair and want to express disapproval, but they probably won’t be the same people each time.
    
    How are you going to censor or deprioritize such expressions of disapproval? By manual mod intervention? AI automation? Instead of going to that trouble and cause a constant stream of resentment from people feeling wronged and silenced, it seems better for Eliezer to just mark the comments that misinterpret him as misinterpretations (maybe through the react system or a more prominent variation of it, if he doesn’t want to just reply to each one and say “this is a misinterpretation). One idea is reacts from the OP author are distinguished or more prominently displayed somehow.
    - habryka 18 Nov 2025 5:32 UTC
      2 points
      −7
      Parent
      I think an issue you’ll face is that few people will “try to socially censure people for using any moderation tools”,
      No, my guess is this is roughly the issue. I think the vast majority of complaints here tend to be centered in a relatively small group of people who really care.
      It’s not a particularly common expectation that people have about how the internet works, as I have said in other places in this thread. I don’t think the rest of the internet gets these kinds of things right, but I also don’t think that there will be an unquenchable torrent of continuous complaints that will create a landscape of perpetual punishment for anyone trying to use moderation tools.
      I think if you resolve a few disagreements, and moderate a relatively small number of people, you end up at an equlibrium that seems a bunch saner to me.
      - Wei Dai 18 Nov 2025 5:54 UTC
        9 points
        3
        Parent
        The rest of the Internet is also not about rationality though. If Eliezer started deleting a lot of highly upvoted comments questioning/criticizing him (even if based on misinterpretations like Eliezer thinks), I bet there will be plenty of people making posts like “look at how biased Eliezer is being here, trying to hide criticism from others!” These posts themselves will get upvoted quite easily, so this will be a cheap/easy way to get karma/status, as well as (maybe subconsciously) getting back at Eliezer for the perceived injustice.
        
        I don’t know if Eliezer is still following this thread or not, but I’m also curious why he thinks there isn’t enough social legitimacy to exercise his mod powers freely, whether its due to a similar kind of expectation.
        habryka 18 Nov 2025 6:10 UTC
        11 points
        13
        Parent
        I mean, yes, these dynamics have caused many people, including myself, to want to leave LessWrong. It sucks. I wish people stopped. Not all moderation is censorship. The fact that it universally gets treated as such by a certain population of LW commenters is one of the worst aspects of this site (and one of the top reasons why in the absence of my own intervention into reviving the site, this site would likely no longer exist at all today).
        I think we can fix it! I think it unfortunately takes a long time, and continuous management and moderation to slowly build trust that indeed you can moderate things without suddenly everyone going insane. Maybe there are also better technical solutions.
        Claiming this is about “rationality” feels like mostly a weird rhetorical move. I don’t think it’s rational to pretend that unmoderated discussion spaces somehow outperform moderated ones. As has been pointed out many times, 4Chan is not the pinnacle of internet discussion. Indeed, I think largely across the internet, more moderation results in higher trust and higher quality discussions (not universally, you can definitely go on a censorious banning spree as a moderator and try to skew consensus in various crazy ways, but by and large, as a correlation).
        This is indeed an observation so core to LessWrong that Well-Kept Gardens Die By Pacifism was, as far as I can tell, a post necessary for LessWrong to exist at all.
        Wei Dai 18 Nov 2025 6:37 UTC
        17 points
        9
        Parent
        
        Not all moderation is censorship.
        
        I’m not saying this, nor are the hypothetical people in my prediction saying this.
        
        Claiming this is about “rationality” feels like mostly a weird rhetorical move.
        
        We are saying that there is an obvious conflict of interest when an author removes a highly upvoted piece of criticism. Humans being biased when presented with COIs is common sense, so connecting such author moderation with rationality is natural, not a weird rhetorical move.
        
        The rest of your comment seems to be forgetting that I’m only complaining about authors having COI when it comes to moderation, not about all moderation in general. E.g. I have occasional complaints like about banning Said, but generally approve of the job site moderators are doing on LW. Or if you’re not forgetting this, then I’m not getting your point. E.g.
        
        I don’t think it’s rational to pretend that unmoderated discussion spaces somehow outperform moderated ones.
        
        I have no idea how this related to my actual complaint.
        habryka 18 Nov 2025 7:48 UTC
        4 points
        0
        Parent
        We are saying that there is an obvious conflict of interest when an author removes a highly upvoted piece of criticism. Humans being biased when presented with COIs is common sense, so connecting such author moderation with rationality is natural, not a weird rhetorical move.
        Look, we’ve had these conversations.
        I am saying the people who are moderating the spaces have the obvious information advantage about their own preferences and about what it’s actually like to engage with an interlocutor, plus the motivation advantage to actually deal with it. “It’s common sense that the best decisions get made by people with skin in the game and who are most involved with the actual consequences of the relevant decision”. And “it’s common sense that CEOs of organizations make hiring and firing decisions for the people they work with, boards don’t make good firing decisions, the same applies to forums and moderators”.
        This is a discussion as old as time in business and governance and whatever. Framing your position as “common sense” is indeed just a rhetorical move, and I have no problem framing the opposite position in just as much of an “obvious” fashion. Turns out, neither position obviously dominates by common sense! Smart people exist on both sides of this debate. I am not against having it again, and I have my own takes on it, but please don’t try to frame this as some kind of foregone conclusion in which you have the high ground.
        The rest of your comment seems to be forgetting that I’m only complaining about authors having COI when it comes to moderation, not about all moderation in general.
        I was (and largely am) modeling you as being generically opposed to basically any non-spam bans or deletions on the site. Indeed, as I think we’ve discussed, the kind of positions that you express in this thread suggest to me that you should be more opposed to site-wide bans than author bans (since site-wide bans truly make counterveiling perspectives harder to find instead of driving them from the comment sections to top-level posts).
        If you aren’t against site-wide bans, I do think that’s a pretty different situation! I certainly didn’t feel like I was empowered to moderate more in our conversations on moderation over the last year. It seemed to me you wanted both less individual author moderation, and less admin moderation for anything that isn’t spam. Indeed, I am pretty sure, though I can’t find it, that you said that LW moderation really should only establish a very basic level of protection against spam and basic norms of discourse, but shouldn’t do much beyond that, but I might be misremembering.
        If you do support moderation, I would be curious about you DMing me some example of users you think we should ban, or non-spam comments we should delete. My current model of you doesn’t really think those exist.
        Wei Dai 20 Nov 2025 3:05 UTC
        9 points
        2
        Parent
        I think you’re right that I shouldn’t have latched onto the first analogy I thought of. Here’s a list of 11 (for transparency, analogies 3-10 were generated by Gemini 3.0 Pro, though some may have appeared in previous discussions.):
        
        The CEO & The Corporation
        The Judge & The Courtroom
        The Dinner Party Host
        The University Classroom / Professor
        The Conference Breakout Session
        Open Source / GitHub Maintainer
        The Stand-Up Comedian & The Heckler
        The Art Gallery Opening
        Graffiti on a Private House
        The Town Hall vs Private Meetings
        The Hypothetical HOA
        
        I decided to put detailed analysis of these analogies in this collapsed section, as despite extensive changes by me from the original AI-generated text, it doesn’t quite read like my style. Also, it might be too much text and my summary/conclusions below may be sufficient to convey the main points.
        1. The CEO & The Corporation
        
        Analogy: A Forum Post is a “Project.” The Author is the CEO; the Commenter is an Employee. The CEO needs the power to fire employees who disrupt the vision, and the Board (Admins) should defer to the CEO’s judgment.
        Disanalogy: In a corporation, the Board cannot see daily operations, creating information asymmetry; on a forum, Admins see the exact same content as the Author. A CEO has a smaller conflict of interest when firing an employee, because they are judged primarily by the company’s financial performance rather than the perception of their ideas. If they fire an employee who makes a good criticism, they might subsequently look better to others, but the company’s performance will suffer.
        Conclusion: The analogy fails because the Author lacks the financial alignment of a CEO and possesses no special private information that the Admins lack.
        
        2. The Judge & The Courtroom
        
        Analogy: When there is a conflict in the physical world, we find disinterested parties to make enforceable judgments, even if the cost is very high. When the cost is too high, we either bear it (wait forever for a trial date) or give up the possibility of justice or enforcement, rather than allow an interested party to make such judgments.
        Disanalogy: A courtroom has the power of Coercion (forcing the loser to pay, go to jail, or stop doing something). A Forum Author only has the power of Dissociation (refusing to host the commenter’s words). We require neutral judges to deprive people of rights/property; we do not require neutral judges to decide who we associate with.
        Conclusion: Dissociation has its own externalities (e.g., hiding of potentially valuable criticism), which we usually regulate via social pressure, or legitimize via social approval, but you don’t want this and therefore need another source of legitimacy.
        
        3. The Dinner Party Host
        
        Analogy: A Post is a private social gathering. The Author is the Host. The Host can kick out a guest for any reason, such as to curate the conversation to his taste.
        Disanalogy: In the real world, if a Host kicks out a guest that everyone else likes, the other attendees would disapprove and often express such disapproval. There is no mechanism to then suppress such disapproval, like you seek.
        Conclusion: You want the power of the Host without the social accountability that naturally regulates a Host’s behavior.
        
        4. The University Classroom / Professor
        
        Analogy: The Author is a Subject Matter Expert (Professor). The Commenter is a Student. The Dean (Admin) lets the Professor silence students to prevent wasting class time.
        Disanalogy: A classroom has a “scarce microphone” (only one person can speak at a time); a forum has threaded comments (parallel discussions), so the “Student” isn’t stopping the “Professor” from teaching. Additionally, LessWrong participants are often peers, not Student/Teacher.
        Conclusion: The justification for silencing students (scarcity of time/attention, asymmetry of expertise) does not apply to LW.
        
        5. The Conference Breakout Session
        
        Analogy: The Author is like an Organizer who “rented the room” at a convention. The Organizer has the right to eject anyone to accomplish his goals.
        Disanalogy: Just like the Dinner Party, an Organizer would almost never eject someone who is popular with their table. If they did, the table would likely revolt.
        Conclusion: This analogy fails to justify the action of overriding the local consensus (upvotes) of the participants in that sub-thread.
        
        6. Open Source / GitHub Maintainer
        
        Analogy: A Post is a Code Repository. A Comment is a Pull Request. The Maintainer has the absolute right to close a Pull Request as “Wontfix” or “Off Topic” to keep the project focused.
        Disanalogy: In Open Source, a rejected Pull Request is Closed, not Deleted. The history remains visible, easy to find, and auditable. Also, this situation is similar to the CEO in that the maintainer is primarily judged on how well their project works, with the “battle of ideas” aspect a secondary consideration.
        Conclusion: You are asking for more power for an Author than a Maintainer, and a Maintainer has less COI for reasons similar to a CEO.
        
        7. The Stand-Up Comedian & The Heckler
        
        Analogy: The Author is a Comedian. The Commenter is a Heckler. Even if the Heckler is funny (Upvoted), they are stealing the show. The Club (Admins) protects the Comedian because writing a set is high-effort.
        Disanalogy: In a physical club, the Heckler interrupts the show. In a text forum, the comment sits below the post. The audience can consume the Author’s “set” without interference before reading the comment.
        Conclusion: The physical constraints that justify silencing a heckler do not exist in a digital text format.
        
        8. The Art Gallery Opening
        
        Analogy: The Post is a Painting. The Upvoted Comment is a Critic framing the art negatively. The Artist removes the Critic to preserve the intended Context of the work.
        Disanalogy: Art is about aesthetics and subjective experience. LessWrong is ostensibly about intellectual progress and truth-seeking.
        Conclusion: Prioritizing “Context” over “Criticism” serves goals that are not LW’s.
        
        9. Graffiti on a Private House
        
        Analogy: A Post is the Author’s House. A Comment is graffiti. The homeowner has the right to scrub the wall (Delete) so neighbors don’t see it.
        Disanalogy: This is purely about property value and aesthetics.
        Conclusion: Again the goals are too different for the analogy to work.
        
        10. The Town Hall vs Private Meetings
        
        Analogy: In the real world we have both town halls (Neutral Moderator) and meetings in private houses (Author Control). We can have both.
        Disanalogy: Even in the discussions inside a private house, social norms usually prevent a host from kicking out a guest who is making popular points that everyone else agrees with.
        Conclusion: The social legitimacy that you seek doesn’t exist here either.
        
        11. The Hypothetical HOA
        
        Analogy: A hypothetical residential community with HOA rules that say, a homeowner not only has the right to kick out any guests during meetings/parties, but no one is allowed to express disapproval for exercising such powers. Anyone who buys a house in the community is required to sign the HOA agreement.
        Disanalogy: There are already many people in the LW community who never “signed” such agreements.
        Conclusion: You are proposing to ask many (“hundreds”) of the existing “homeowners” (some of whom have invested years of FTE work into site participation) to leave, which is implausible in this hypothetical analogy.
        
        Overall Conclusions
        
        None of the analogies are perfect, but we can see some patterns when considering them together.
        
        Neutral, disinterested judgement is a standard social technology for gaining legitimacy. In the case of courts, it is used to legitimize coercion, an otherwise illegitimate activity that would trigger much opposition. In the case of a forum, it can be used to legitimize (or partly legitimize) removing/hiding/deprioritizing popular/upvoted critiques.
        Some analogies provide a potential new idea for gaining such legitimacy in some cases: relatively strong and short external feedback loops like financial performance (for the CEO) and real-world functionality (for the open source maintainer) can legitimize greater unilateral discretion. This can potentially work on certain types of posts, but most lack such short-term feedback.
        In other cases, suppression of dissent is legitimized for specific reasons clearly not applicable to LW, such as clear asymmetry of expertise between speaker and audience, or physical constraints.
        In the remaining cases, the equivalent of author moderation (e.g., kicking out a houseguest) is legitimized only by social approval, but this is exactly what you and Eliezer want to avoid.
        
        Having gone through all of these possible analogies, I think my intuition for judges/courts being the closest analogy to moderation is correct after all: in both cases, disinterested judgement seems to be the best or only way to gain social legitimacy for unpopular decisions.
        
        However, this exercise also made me realize that in most of the real world we do allow people to unilaterally exercise the power of dissociation, as long as it’s regulated by social approval or disapproval, and this may be a reasonable prior for LW.
        
        Perhaps the strongest argument (for my most preferred policy of no author moderation, period) at this point is that unlike the real world, we lack clear boundaries to signal when we are entering a “private space”, nor is it clear how much power/responsibility the authors are supposed to have, with the site mods also being around. The result is a high cost of background confusion (having to track different people’s moderation policies/styles or failing to do so and being surprised) as well as high probability of drama/distraction whenever it is used, because people disagree or are confused about the relevant norms.
        
        On the potential benefits side, the biggest public benefits of moderation can only appear when it’s against the social consensus, otherwise karma voting would suffice as a kind of moderation. But in this case clearly social approval can’t be a source of legitimacy, and if disinterested judgment and external feedback are also unavailable as sources of legitimacy, then it’s hard to see what can work. (Perhaps worth reemphasizing here, I think this intuitive withholding of legitimacy is correct, due to the high chance of abuse when none of these mechanisms are available.) This leaves the private psychological benefit to the author, which is something I can’t directly discuss (due to not having a psychology that wants to “hard” moderate others), and can only counter with the kind of psychological cost to author-commenters like myself, as described in the OP.
        Wei Dai 18 Nov 2025 7:35 UTC
        4 points
        2
        Parent
        @Ben Pace I’m surprised that you’re surprised. Where did your impression that I generally disapprove of the job site moderators are doing on LW come from, if you can recall?
        Ben Pace 18 Nov 2025 8:57 UTC
        2 points
        0
        Parent
        In the last year I’d guess you’ve written over ten thousand words complaining about LW moderation over dozens of comments, and I don’t recall you ever saying anything positive about the moderation? I recall once said that you won’t leave the site over our actions (so far), which sounds like you’ll bear our moderation, but is quite different from saying it’s overall good.
        Wei Dai 18 Nov 2025 9:26 UTC
        8 points
        2
        Parent
        Thanks, to clarify some more in case it’s helpful, I think I’ve only complained about 2 things, the Said banning and the author moderation policy, and the word count was just from a lot of back and forth, not the number issues I’ve had with the mod team? A lot of what you do is just invisible to me, like the user pre-filtering that habryka mentioned and the routine moderation work, but I assume you’re doing a good job on them, as I’m pretty happy with the general LW environment as far as lack of spam, generally good user behavior, and not seeing many complaints about being unfairly moderated by the mod team, etc.
        
        Found my quote about not leaving:
        
        My response to this is that I don’t trust people to garden their own space, along with other reasons to dislike the ban system. I’m not going to leave LW over it though, but just be annoyed and disappointed at humanity whenever I’m reminded of it.
        
        Yeah I think you misinterpreted it. I was just trying to say that unlike those who got what they wanted (the author mod policy) by leaving or threatening to leave, I’m explicitly not using this threat as a way to get what I want. It was a way to claim the moral high ground I guess. Too bad the message misfired.
        wedrifid 20 Nov 2025 4:25 UTC
        4 points
        2
        Parent
        rsaarelm gave an excellent explanation early on about how the issue seems to be an incompatibility between forum mechanics and blog mechanics, rather than an issue with moderation itself. It would be unfortunate if the point was overlooked because it misunderstood as “moderation is bad”.
        
        It is fair to say that a blog with a policy “I’ll moderate however I like, if you don’t like it leave” works fine. It’s the default and implicit.
        
        When it comes to a forum system with as many potential posters as there are commenters then “If you don’t like it leave” is the implicit ultimatum from every single user to every other. But if the feed system that governs content exposure doesn’t allow leaving individual posters, then the only thing that could be left is the entire forum.
        
        This is why all other significant sites with a many producers → many consumers model all have unsubscribe, mute and/or block features. It helps ensure a few weeds in the Well-Kept Garden don’t drive away all the plants with low toxin tolerance.
        
        It sounds like—particularly from testimony from habryka and Eliezer—moving to a more meta-blog like system is/was critical to lesswrong being viable. Which means leaning in to that structure and fully implementing the requisite features seems like an easy way to improve the experience of everyone.

Wei Dai comments on Wei Dai’s Shortform

1. The CEO & The Corporation

2. The Judge & The Courtroom

3. The Dinner Party Host

4. The University Classroom /​ Professor

5. The Conference Breakout Session

6. Open Source /​ GitHub Maintainer

7. The Stand-Up Comedian & The Heckler

8. The Art Gallery Opening

9. Graffiti on a Private House

10. The Town Hall vs Private Meetings

11. The Hypothetical HOA

Overall Conclusions

4. The University Classroom / Professor

6. Open Source / GitHub Maintainer