I think it would be valuable to ask Anthropic’s policy team (and/or leadership) if they agree with these statements (or adjacent statements), and if they have any plans to prioritize these kinds of statements in their communications with policymakers & the public.
It seems to me like a lot of Anthropic employees agree with these statements (or adjacent statements), yet this does not appear to be guiding Anthropic’s official lobbying or policy activities.
I think that the technology being built by companies like Anthropic has a significant (read: double-digit) probability of destroying the entire future of the human species.
What’s more, I think no private company should be in a position to impose this kind of risk on every living human, and I support efforts to make sure that no company ever is.
Further: I do not think that Anthropic or any other actor has an adequate plan for building superintelligence in a manner that brings the risk of catastrophic, civilization-ending misalignment to a level that a prudent and coordinated civilization would accept.
More specifically: I do not believe that the object-level benefits of advanced AI[18] – serious though they may be – currently justify the level of existential risk at stake in any actor, Anthropic included, developing superintelligence given our current understanding of how to do so safely.[19]
But there is, indeed, a clear solution to this problem in principle: namely, to use various methods of capability restraint (coordination, enforcement, etc) to ensure that no one develops superintelligence until we have a radically better understanding of how to do so safely.
I have no idea how Anthropic’s policy team makes decisions, but insofar as they value the input of employees on other teams, it seems plausible to me that Anthropic employees with these beliefs (or adjacent beliefs) could play a meaningful role by speaking out about these beliefs, requesting more information about Anthropic’s policy engagements, and having more discussions with Anthropic policy/leadership teams about if/how Anthropic could prioritize these topics more in its policy work & public comms.
Anthropic releases new post about RSI and slowdowns—also commits to doing verification research and verification-focused policymaker engagement. (In my view, this is probably the best post/announcement about AI risks that has been produced by any frontier AI company in recent memory).
https://www.anthropic.com/institute/recursive-self-improvement