Michaël Trazzi comments on Which side of the AI safety community are you in?

Michaël Trazzi 23 Oct 2025 3:08 UTC
19 points
−1
I’d split things this way:
- Group A) “Given that stopping the AI race seems nearly impossible, I focus on ensuring humanity builds safe superintelligence”
- Group B) “Given that building superintelligence safely under current race dynamics seems nearly impossible, I focus on stopping the AI race”
- Group C) “Given deep uncertainty about whether we can align superintelligence under race conditions or stop the race itself, I work to ensure both strategies receive enough resources.”
- TsviBT 23 Oct 2025 5:27 UTC
  −2 points
  −13
  Parent
  C is fake, it’s part of A, and A is fake, it’s washing something which we don’t yet understand and should not pretend to understand.
  - Davidmanheim 23 Oct 2025 11:52 UTC
    7 points
    3
    Parent
    ...but it’s not fake, it’s just confused according to your expectations about the future—and yes, some people may say it dishonestly, but we should still be careful not to deny that people can think things you disagree with, just because they conflict with your map of the territory.
    
    That said, I don’t see as much value in dichotomizing the groups as others seem to.
    - TsviBT 23 Oct 2025 12:18 UTC
      2 points
      3
      Parent
      What? Surely “it’s fake” is a fine way to say “most people who would say they are in C are not actually working that way and are deceptively presenting as C”? It’s fake.
      - Davidmanheim 23 Oct 2025 14:47 UTC
        2 points
        0
        Parent
        If you said “mostly bullshit” or “almost always disengenious” I wouldn’t argue, but would still question whether it’s actually a majority of people in group C, which I’m doubtful of, but very unsure about—but saying it is fake would usually mean it is not a real thing anyone believes, rather than meaning that the view is unusual or confused or wrong.
        
        Closely related to: You Don’t Exist, Duncan.
        TsviBT 23 Oct 2025 15:37 UTC
        5 points
        −4
        Parent
        I guess we could say “mostly fake”, but also there’s important senses in which “mostly fake” implies “fake simpliciter”. E.g. a twinkie made of “mostly poison” is just “a poisonous twinkie”. Often people do, and should, summarize things and then make decisions based on the summaries, e.g. “is it poison, or no” --> “can I eat it, or no”. My guess is that the conditions under which it would make sense for you to treat someone as genuinely holding position C, e.g. for purposes of allocating funding to them, are currently met by approximately no one. I could plausibly be wrong about that, I’m not so confident. But that is the assertion I’m trying to make, which is summarized imprecisely as “C is fake”, and I stand by my making that assertion in this context. (Analogy: It’s possible for me to be wrong that 2+2=4, but when I say 2+2=4, what I’m asserting / guessing is that 2+2=4 always, everywhere, exactly. https://www.lesswrong.com/s/FrqfoG3LJeCZs96Ym/p/ooypcn7qFzsMcy53R )
        Chris van Merwijk 25 Oct 2025 6:21 UTC
        7 points
        10
        Parent
        Just to clarify:
        
        I am personally uncertain how hard stopping the race is. I have spent some time and money myself trying to promote IABIED, and I have also been trying to do direct alignment research, and when doing so I more often than not think explicitly in scenarios where the AI race does not stop.
        
        Am I in group C? Am I a fake member of C?
        
        I’d personally say I’d probably endorse C for someone who funds research/activism, and have personally basically acted on it.
        
        I.e. I’d say it’s reasonable to say “stop the race ASAP”, and in another context say “the race might not be stopped, what projects would still maybe increase odds of survival/success conditional on a race?”
        TsviBT 25 Oct 2025 13:31 UTC
        2 points
        0
        Parent
        
        Am I in group C? Am I a fake member of C?
        
        (IDK and I wouldn’t be the one to judge, and there doesn’t necessarily have to be one to judge.) I guess I’d be a bit more inclined to believe it of you? But it would take more evidence. For example, it would depend how your stances express themselves specifically in “political” contexts, i.e. in contexts where power is at stake (company governance, internal decision-making by an academic lab about allocating attentional resources, public opinion / discussion, funding decisions, hiring advice). And if you don’t have a voice in such contexts then you don’t count as much of a member of C. (Reminder that I’m talking about “camps”, not sets of individual people with propositional beliefs.)
        Davidmanheim 26 Oct 2025 7:38 UTC
        4 points
        0
        Parent
        It seems like you’re narrowing the claim, and I’m no longer sure I disagree with the point, if I’m interpreting it correctly now.
        
        If you’re saying that they group doesn’t act differently in ways that are visible to you, sure—but the definition of the group is one that believes that two things are viable, and will sometimes support one side, and sometimes support the other. You could say it doesn’t matter for making individual decisions, because the people functionally are supporting one side or the other at a given time, but that’s different than saying they “are not actually working that way.”
        Davidmanheim 23 Oct 2025 16:30 UTC
        2 points
        0
        Parent
        There’s a huge difference between the types of cases, though. A 90% poisonous twinkie is certainly fine to call poisonous^[1], but a 90% male groups isn’t reasonable to call male. You said “if most people who would say they are in C are not actually working that way and are deceptively presenting as C,” that seems far like the latter than the former, because “fake” implies the entire thing is fake^[2].
        ^
        Though so is a 1% poisonous twinkie; perhaps the example should be a meal that is 90% protein would be a “protein meal” without implying there is no non-protein substance present.
        ^
        There is a sense where this isn’t true; if 5% of an image of a person is modified, I’d agree that the image is fake—but this is because the claim of fakeness is about the entirety of the image, as a unit. In contrast, if there were 20 people in a composite image, and 12 of them were AI-fakes and 8 were actual people, I wouldn’t say the picture is “of fake people,” I’d need to say it’s a mixture of fake and real people. Which seems like the relevant comparison if, as you said in another comment, you are describing “empirical clusters of people”!
        TsviBT 23 Oct 2025 16:37 UTC
        1 point
        −1
        Parent
        The OP is about two “camps” of people. Do you understand what camps are? Hopefully you can see that this indeed does induce the analog of “because the claim of fakeness is about the entirety of the image”. They gain and direct funding, consensus, hiring, propaganda, vibes, parties, organizations, etc., approximately as a unit. Camp A is a 90% poison twinkie. The fact that you are trying to not process this is a problem.
        Davidmanheim 23 Oct 2025 22:31 UTC
        4 points
        0
        Parent
        I’m pointing out that the third camp, which you deny really exists, does exist, and as an aside, is materially different in important ways from the other two camps.
        You say you don’t think this matters for allocating funding, and you don’t care about what others actually believe. I’m just not sure why either point is relevant here.
        TsviBT 23 Oct 2025 22:42 UTC
        15 points
        0
        Parent
        Could you name a couple (2 or 3, say) of some of the biggest representatives of that camp? Biggest in the camp sense, so e.g. high reputation researchers or high net worth funders.
        Davidmanheim 24 Oct 2025 7:14 UTC
        8 points
        2
        Parent
        You started by saying that most people who would say they are in C are fake, because they are not actually working that way and are deceptively presenting as C, and that A is also “fake” because it won’t work. So anyone I name in group C, under your view, is just being dishonest. I think that there are many people who have good faith beliefs in both groups, but don’t understand how naming them helps address the claim you made. (You also said that it only matters if the view exists if it’s held by funders, since I guess you claim that only people spending money can have views about what resource allocation should occur.)
        
        That said, other than myself, who probably doesn’t count because I’m only in charge of minor amounts of money, it seems that a number of people at Open Philanthropy clearly implicitly embrace view C, based on their funding decisions which include geopolitical efforts to manage risk from AI and potentially lead to agreements, public awareness and education, and also funding technical work on AI safety.
        And see this newer post, which also lays out a similar view: https://www.lesswrong.com/posts/7xCxz36Jx3KxqYrd9/plan-1-and-plan-2
  - Eli Tyre 24 Oct 2025 19:24 UTC
    2 points
    0
    Parent
    it’s washing something which we don’t yet understand and should not pretend to understand.
    Washing? Like safetywashing?
    - TsviBT 24 Oct 2025 19:28 UTC
      2 points
      0
      Parent
      Yeah, safetywashing, or I guess mistake-theory-washing.
  - TsviBT 23 Oct 2025 20:27 UTC
    2 points
    0
    Parent
    (I would like to note that a single person went through and strong downvoted my comments here.)
  - Michaël Trazzi 23 Oct 2025 15:40 UTC
    2 points
    0
    Parent
    You make a valid point. Here’s another framing that makes the tradeoff explicit:
    Group A) “Alignment research is worth doing even though it might provide cover for racing”
    Group B) “The cover problem is too severe. We should focus on race-stopping work instead”
    - TsviBT 23 Oct 2025 16:04 UTC
      6 points
      6
      Parent
      I think we’re just trying to do different things here… I’m trying to describe empirical clusters of people / orgs, you’re trying to describe positions, maybe? And I’m taking your descriptions as pointers to clusters of people, of the form “the cluster of people who say XYZ”. I think my interpretation is appropriate here because there is so much importance-weighted abject insincerity in publicly stated positions regarding AGI X-risk that it just doesn’t make much sense to focus on the stated positions as positions.
      
      Like, the actual people at The Curve or whatever are less “I will do alignment, and will be against racing, and alas, this may provide some cover” and more “I will do fake alignment with no sense that I should be able to present any plausible connection between my work and making safe AGI, and I will directly support racing”. All the people who actually do the stated thing are generally understood to be irrelevant weirdos. The people who say that are being insincere, and in fact support racing.
      - Michaël Trazzi 23 Oct 2025 16:23 UTC
        2 points
        0
        Parent
        ~~I was trying to map out disagreements between people who are concerned enough about AI risk.~~
        Agreed that this represents only a fraction of the people who talk about AI risk, and that there are a lot of people who will use some of these arguments as false justifications for their support of racing.
        
        EDIT: as TsviBT pointed out in his comment, OP is actually about people who self-identify as members of the AI Safety community. Given that, I think that the two splits I mentioned above are still useful models, since most people I end up meeting who self-identify as members of the community seem to be sincere, without stated positions that differ from their actual reasons for why they do things. I have met people who I believe to be insincere, but I don’t think they self-identify as part of the AI Safety community. I think that TsviBT’s general point about insincerity in the AI Safety discourse is valid.
        TsviBT 23 Oct 2025 16:32 UTC
        2 points
        −8
        Parent
        Um, no, you responded to the OP with what sure seems like a proposed alternative split. The OP’s split is about
        
        people who self-identify as members of the AI safety community
        
        I think you are making an actual mistake in your thinking, due to a significant gap in your thinking and not just a random thing, and with bad consequences, and I’m trying to draw your attention to it.