What AI safety plans are there?

I’m trying to make a list of every written plan for how to make AGI go well. I am interested in hearing about any plans that I missed, or any lists of plans that have already been compiled.

Here are the plans I’ve found so far, in no particular order:

It’s somewhat arbitrary as to how much detail something needs to qualify as “a plan”. I guess I would define a plan as “if we do these things and they all work, then we will be okay.” By this definition Google is the only company with an actual plan (as far as I know) but I included the closest thing I could find from OpenAI + Anthropic.

Patodesu 8 May 2025 7:34 UTC
3 points
0
I think this is mostly composed of partial technical plans, but maybe you find it useful: https://ai-plans.com/.
Also, I really like these high-level scenarios and plans described by the MIRI technical governance team: https://www.lesswrong.com/posts/WkCfvqyjCzvRrwkaQ/ai-governance-to-avoid-extinction-the-strategic-landscape.
Chris_Leong 26 Apr 2025 14:28 UTC
3 points
0
You might find this document I created to be interesting: Proposed Alignment Solutions.
Dusto 24 Apr 2025 5:46 UTC
2 points
0
Only other one I saw was for Microsoft. One other to watch would be Amazon (no explicit plan yet), and I guess Meta but is “open-source” a plan?