AI Governance

TagLast edit: 8 Feb 2025 0:32 UTC by lesswrong-internal

AI Governance asks how we can ensure society benefits at large from increasingly powerful AI systems. While solving technical AI alignment is a necessary step towards this goal, it is by no means sufficient.

Governance includes policy, economics, sociology, law, and many other fields.

AI policy ideas: Reading list

Zach Stein-Perlman17 Apr 2023 19:00 UTC

24 points

7 comments4 min readLW link

What an actually pessimistic containment strategy looks like

lc5 Apr 2022 0:19 UTC

681 points

138 comments6 min readLW link 2 reviews

Ways I Expect AI Regulation To Increase Extinction Risk

1a3orn4 Jul 2023 17:32 UTC

234 points

32 comments7 min readLW link

Speaking to Congressional staffers about AI risk

Orpheus16 and hath

4 Dec 2023 23:08 UTC

312 points

25 comments15 min readLW link 1 review

On MAIM and Superintelligence Strategy

Zvi14 Mar 2025 12:30 UTC

53 points

2 comments13 min readLW link

(thezvi.wordpress.com)

Global Call for AI Red Lines—Signed by Nobel Laureates, Former Heads of State, and 200+ Prominent Figures

Charbel-Raphaël22 Sep 2025 18:22 UTC

325 points

27 comments6 min readLW link

What would a compute monitoring plan look like? [Linkpost]

Orpheus1626 Mar 2023 19:33 UTC

158 points

10 comments4 min readLW link

(arxiv.org)

Helen Toner on China, CSET, and AI

Rob Bensinger21 Apr 2019 4:10 UTC

68 points

4 comments7 min readLW link

(rationallyspeakingpodcast.org)

RTFB: On the New Proposed CAIP AI Bill

Zvi10 Apr 2024 18:30 UTC

119 points

14 comments34 min readLW link

(thezvi.wordpress.com)

Compute Thresholds: proposed rules to mitigate risk of a “lab leak” accident during AI training runs

davidad22 Jul 2023 18:09 UTC

80 points

2 comments2 min readLW link

Shift Resources to Advocacy Now (Post 4 of 7 on AI Governance)

Mass_Driver28 May 2025 1:19 UTC

60 points

18 comments32 min readLW link

Reactions to the Executive Order

Zvi1 Nov 2023 20:40 UTC

77 points

4 comments29 min readLW link

(thezvi.wordpress.com)

Should we postpone AGI until we reach safety?

otto.barten18 Nov 2020 15:43 UTC

27 points

36 comments3 min readLW link

Orphaned Policies (Post 5 of 7 on AI Governance)

Mass_Driver29 May 2025 21:42 UTC

70 points

5 comments16 min readLW link

News : Biden-⁠Harris Administration Secures Voluntary Commitments from Leading Artificial Intelligence Companies to Manage the Risks Posed by AI

Jonathan Claybrough21 Jul 2023 18:00 UTC

65 points

10 comments2 min readLW link

(www.whitehouse.gov)

AI labs’ statements on governance

Zach Stein-Perlman4 Jul 2023 16:30 UTC

30 points

0 comments36 min readLW link

We’re Not Ready: thoughts on “pausing” and responsible scaling policies

HoldenKarnofsky27 Oct 2023 15:19 UTC

200 points

33 comments8 min readLW link

Response to Aschenbrenner’s “Situational Awareness”

Rob Bensinger6 Jun 2024 22:57 UTC

197 points

27 comments3 min readLW link

Where are the red lines for AI?

Karl von Wendt5 Aug 2022 9:34 UTC

26 points

10 comments6 min readLW link

If-Then Commitments for AI Risk Reduction [by Holden Karnofsky]

habryka13 Sep 2024 19:38 UTC

28 points

0 comments20 min readLW link

(carnegieendowment.org)

[Question] Where are people thinking and talking about global coordination for AI safety?

Wei Dai22 May 2019 6:24 UTC

112 points

22 comments1 min readLW link

Lessons from the Iraq War for AI policy

Buck10 Jul 2025 18:52 UTC

190 points

25 comments4 min readLW link

President Biden Issues Executive Order on Safe, Secure, and Trustworthy Artificial Intelligence

Tristan Williams30 Oct 2023 11:15 UTC

171 points

39 comments3 min readLW link

(www.whitehouse.gov)

An upcoming US Supreme Court case may impede AI governance efforts

NickGabs16 Jul 2023 23:51 UTC

57 points

17 comments2 min readLW link

The Need for Political Advertising (Post 2 of 7 on AI Governance)

Mass_Driver21 May 2025 0:44 UTC

59 points

2 comments13 min readLW link

The Regulatory Option: A response to near 0% survival odds

Matthew Lowenstein11 Apr 2022 22:00 UTC

46 points

21 comments6 min readLW link

List of requests for an AI slowdown/halt.

Cleo Nardo14 Apr 2023 23:55 UTC

46 points

6 comments1 min readLW link

Soft takeoff can still lead to decisive strategic advantage

Daniel Kokotajlo23 Aug 2019 16:39 UTC

122 points

47 comments8 min readLW link 4 reviews

[Question] Would it be good or bad for the US military to get involved in AI risk?

Grant Demaree1 Jan 2023 19:02 UTC

50 points

12 comments1 min readLW link

The Milton Friedman Model of Policy Change

JohnofCharleston4 Mar 2025 0:38 UTC

143 points

17 comments4 min readLW link

Actionable-guidance and roadmap recommendations for the NIST AI Risk Management Framework

Dan H and Tony Barrett

17 May 2022 15:26 UTC

26 points

0 comments3 min readLW link

Slowing AI: Foundations

Zach Stein-Perlman17 Apr 2023 14:30 UTC

45 points

11 comments17 min readLW link

The Paris AI Anti-Safety Summit

Zvi12 Feb 2025 14:00 UTC

129 points

21 comments21 min readLW link

(thezvi.wordpress.com)

What We Learned from Briefing 70+ Lawmakers on the Threat from AI

leticiagarcia27 May 2025 18:23 UTC

484 points

15 comments16 min readLW link

(substack.com)

AI #35: Responsible Scaling Policies

Zvi26 Oct 2023 13:30 UTC

66 points

10 comments55 min readLW link

(thezvi.wordpress.com)

On the UK Summit

Zvi7 Nov 2023 13:10 UTC

74 points

6 comments30 min readLW link

(thezvi.wordpress.com)

AXRP Episode 28 - Suing Labs for AI Risk with Gabriel Weil

DanielFilan17 Apr 2024 21:42 UTC

12 points

0 comments65 min readLW link

New voluntary commitments (AI Seoul Summit)

Zach Stein-Perlman21 May 2024 11:00 UTC

81 points

17 comments7 min readLW link

(www.gov.uk)

Transformative trustbuilding via advancements in decentralized lie detection

trevor16 Mar 2024 5:56 UTC

20 points

10 comments38 min readLW link

(www.ncbi.nlm.nih.gov)

Finishing The SB-1047 Documentary In 6 Weeks

Michaël Trazzi28 Oct 2024 20:17 UTC

94 points

7 comments4 min readLW link

(manifund.org)

Briefly analyzing the 10-year moratorium amendment

RobertM28 May 2025 3:11 UTC

73 points

1 comment3 min readLW link

All images from the WaitButWhy sequence on AI

trevor8 Apr 2023 7:36 UTC

73 points

5 comments2 min readLW link

Guide to SB 1047

Zvi20 Aug 2024 13:10 UTC

71 points

18 comments53 min readLW link

(thezvi.wordpress.com)

AI Governance & Strategy: Priorities, talent gaps, & opportunities

Orpheus163 Mar 2023 18:09 UTC

56 points

2 comments4 min readLW link

My thoughts on OpenAI’s alignment plan

Orpheus1630 Dec 2022 19:33 UTC

55 points

3 comments20 min readLW link

Good government

rosehadshar10 Sep 2025 13:22 UTC

26 points

0 comments6 min readLW link

OpenAI could help X-risk by wagering itself

VojtaKovarik20 Apr 2023 14:51 UTC

31 points

16 comments1 min readLW link

Applying right-wing frames to AGI (geo)politics

Richard_Ngo8 Jul 2025 18:03 UTC

64 points

25 comments3 min readLW link

(x.com)

AI Moratorium Stripped From BBB

Zvi1 Jul 2025 18:50 UTC

70 points

4 comments6 min readLW link

(thezvi.wordpress.com)

[Linkpost] Biden-Harris Executive Order on AI

beren30 Oct 2023 15:20 UTC

3 points

0 comments1 min readLW link

A path to human autonomy

Nathan Helm-Burger29 Oct 2024 3:02 UTC

53 points

16 comments20 min readLW link

AI #88: Thanks for the Memos

Zvi31 Oct 2024 15:00 UTC

46 points

5 comments77 min readLW link

(thezvi.wordpress.com)

Some cruxes on impactful alternatives to AI policy work

Richard_Ngo10 Oct 2018 13:35 UTC

165 points

13 comments12 min readLW link

How should AI systems behave, and who should decide? [OpenAI blog]

ShardPhoenix17 Feb 2023 1:05 UTC

22 points

2 comments1 min readLW link

(openai.com)

SB 1047: Final Takes and Also AB 3211

Zvi27 Aug 2024 22:10 UTC

92 points

11 comments21 min readLW link

(thezvi.wordpress.com)

The Jackpot Jinx (or why “Superintelligence Strategy” is wrong)

E.G. Blee-Goldman10 Mar 2025 19:18 UTC

13 points

0 comments5 min readLW link

My takes on SB-1047

leogao9 Sep 2024 18:38 UTC

151 points

8 comments4 min readLW link

Little attention seems to be on discouraging hardware progress

RussellThor30 Jun 2023 10:14 UTC

5 points

3 comments1 min readLW link

AXRP Episode 38.1 - Alan Chan on Agent Infrastructure

DanielFilan16 Nov 2024 23:30 UTC

12 points

0 comments14 min readLW link

ChatGPT banned in Italy over privacy concerns

Ollie J31 Mar 2023 17:33 UTC

18 points

4 comments1 min readLW link

(www.bbc.co.uk)

[Question] How promising are legal avenues to restrict AI training data?

thehalliard10 Dec 2022 16:31 UTC

9 points

2 comments1 min readLW link

FLI report: Policymaking in the Pause

Zach Stein-Perlman15 Apr 2023 17:01 UTC

15 points

3 comments1 min readLW link

(futureoflife.org)

[Question] Snapshot of narratives and frames against regulating AI

Jan_Kulveit1 Nov 2023 16:30 UTC

36 points

19 comments3 min readLW link

[Question] What Are Your Preferences Regarding The FLI Letter?

JenniferRM1 Apr 2023 4:52 UTC

−4 points

122 comments16 min readLW link

OpenAI’s Preparedness Framework: Praise & Recommendations

Orpheus162 Jan 2024 16:20 UTC

66 points

1 comment7 min readLW link

Mythbusting the supposed “1,000+ AI state bills that would hobble innovation”

sjadler2 Jul 2025 4:49 UTC

6 points

0 comments1 min readLW link

(stevenadler.substack.com)

Cyborg Periods: There will be multiple AI transitions

Jan_Kulveit and rosehadshar

22 Feb 2023 16:09 UTC

109 points

9 comments6 min readLW link

The AGI Race Between the US and China Doesn’t Exist.

Eva_B3 Jun 2023 0:22 UTC

33 points

15 comments7 min readLW link

(evabehrens.substack.com)

How much to update on recent AI governance moves?

habryka and So8res

16 Nov 2023 23:46 UTC

112 points

5 comments29 min readLW link

AGI will be made of heterogeneous components, Transformer and Selective SSM blocks will be among them

Roman Leventov27 Dec 2023 14:51 UTC

33 points

9 comments4 min readLW link

The AI Adoption Gap: Preparing the US Government for Advanced AI

Lizka2 Apr 2025 23:46 UTC

14 points

2 comments17 min readLW link

(www.forethought.org)

My favorite AI governance research this year so far

Zach Stein-Perlman23 Jul 2023 16:30 UTC

26 points

1 comment7 min readLW link

(blog.aiimpacts.org)

Advice for Activists from the History of Environmentalism

Jeffrey Heninger16 May 2024 18:40 UTC

100 points

10 comments6 min readLW link

(blog.aiimpacts.org)

Talking to Congress: Can constituents contacting their legislator influence policy?

Tristan Williams7 Mar 2024 9:24 UTC

14 points

0 comments19 min readLW link

What does it take to defend the world against out-of-control AGIs?

Steven Byrnes25 Oct 2022 14:47 UTC

212 points

51 comments30 min readLW link 1 review

Many AI governance proposals have a tradeoff between usefulness and feasibility

Orpheus16 and Carson Ezell

3 Feb 2023 18:49 UTC

22 points

2 comments2 min readLW link

National Telecommunications and Information Administration: AI Accountability Policy Request for Comment

sanxiyn11 Apr 2023 22:59 UTC

9 points

0 comments1 min readLW link

(ntia.gov)

Political Funding Expertise (Post 6 of 7 on AI Governance)

Mass_Driver19 Jun 2025 14:14 UTC

59 points

4 comments14 min readLW link

I asked my senator to slow AI

Omid6 Apr 2023 18:18 UTC

21 points

5 comments2 min readLW link

On the Meta and DeepMind Safety Frameworks

Zvi7 Feb 2025 13:10 UTC

45 points

1 comment17 min readLW link

(thezvi.wordpress.com)

China-AI forecasts

NathanBarnard25 Feb 2024 16:49 UTC

40 points

29 comments6 min readLW link

Convergence 2024 Impact Review

David_Kristoffersson24 Mar 2025 20:28 UTC

13 points

0 comments14 min readLW link

[Question] AI safety milestones?

Zach Stein-Perlman23 Jan 2023 21:00 UTC

7 points

5 comments1 min readLW link

Anthropic, Google, Microsoft & OpenAI announce Executive Director of the Frontier Model Forum & over $10 million for a new AI Safety Fund

Zach Stein-Perlman25 Oct 2023 15:20 UTC

31 points

8 comments4 min readLW link

(www.frontiermodelforum.org)

Focus transparency on risk reports, not safety cases

ryan_greenblatt22 Sep 2025 15:27 UTC

47 points

3 comments6 min readLW link

My guess at Conjecture’s vision: triggering a narrative bifurcation

Alexandre Variengien6 Feb 2024 19:10 UTC

75 points

12 comments16 min readLW link

Democratic AI Constitution: Round-Robin Debate and Synthesis

scottviteri24 Jun 2023 19:31 UTC

10 points

4 comments5 min readLW link

(scottviteri.com)

Stopping dangerous AI: Ideal lab behavior

Zach Stein-Perlman9 May 2023 21:00 UTC

8 points

0 comments2 min readLW link

Jack Clark on the realities of AI policy

Kaj_Sotala7 Aug 2022 8:44 UTC

68 points

3 comments3 min readLW link

(threadreaderapp.com)

My Assessment of the Chinese AI Safety Community

Lao Mein25 Apr 2023 4:21 UTC

254 points

95 comments3 min readLW link

AI companies are unlikely to make high-assurance safety cases if timelines are short

ryan_greenblatt23 Jan 2025 18:41 UTC

145 points

5 comments13 min readLW link

Why I Am Skeptical of AI Regulation as an X-Risk Mitigation Strategy

A Ray6 Aug 2022 5:46 UTC

31 points

14 comments2 min readLW link

Dario Amodei leaves OpenAI

Daniel Kokotajlo29 Dec 2020 19:31 UTC

69 points

13 comments1 min readLW link

Schelling points in the AGI policy space

mesaoptimizer26 Jun 2024 13:19 UTC

52 points

2 comments6 min readLW link

[Research log] The board of Alphabet would stop DeepMind to save the world

Lucie Philippon16 Jul 2024 4:59 UTC

6 points

0 comments4 min readLW link

Please Donate to CAIP (Post 1 of 7 on AI Governance)

Mass_Driver7 May 2025 17:13 UTC

119 points

20 comments33 min readLW link

The Dissolution of AI Safety

Roko12 Dec 2024 10:34 UTC

8 points

44 comments1 min readLW link

(www.transhumanaxiology.com)

Should there be just one western AGI project?

rosehadshar and Tom Davidson

3 Dec 2024 10:11 UTC

78 points

75 comments15 min readLW link

(www.forethought.org)

Policy discussions follow strong contextualizing norms

Richard_Ngo1 Apr 2023 23:51 UTC

231 points

61 comments3 min readLW link

[Question] Have any parties in the current European Parliamentary Election made public statements on AI?

MondSemmel10 May 2024 10:22 UTC

9 points

0 comments1 min readLW link

One example of how LLM propaganda attacks can hack the brain

trevor16 Aug 2023 21:41 UTC

27 points

8 comments4 min readLW link

How much might AI legislation cost in the U.S.?

will rinehart28 May 2025 16:21 UTC

−5 points

0 comments11 min readLW link

Invitation to lead a project at AI Safety Camp (Virtual Edition, 2025)

Linda Linsefors, Remmelt Ellen and Robert Kralisch

23 Aug 2024 14:18 UTC

17 points

2 comments4 min readLW link

Learning societal values from law as part of an AGI alignment strategy

John Nay21 Oct 2022 2:03 UTC

5 points

18 comments54 min readLW link

AI pause/governance advocacy might be net-negative, especially without a focus on explaining x-risk

Mikhail Samin27 Aug 2023 23:05 UTC

72 points

9 comments6 min readLW link

Thoughts on responsible scaling policies and regulation

paulfchristiano24 Oct 2023 22:21 UTC

220 points

34 comments6 min readLW link

Mitigating extreme AI risks amid rapid progress [Linkpost]

Orpheus1621 May 2024 19:59 UTC

21 points

7 comments4 min readLW link

Stopping dangerous AI: Ideal US behavior

Zach Stein-Perlman9 May 2023 21:00 UTC

17 points

0 comments3 min readLW link

Alignment can be the ‘clean energy’ of AI

Cameron Berg, Judd Rosenblatt and Trent Hodgeson

22 Feb 2025 0:08 UTC

68 points

8 comments8 min readLW link

The 6D effect: When companies take risks, one email can be very powerful.

scasper4 Nov 2023 20:08 UTC

286 points

42 comments3 min readLW link

The Tech Industry is the Biggest Blocker to Meaningful AI Safety Regulations

garrison16 Aug 2024 19:37 UTC

22 points

1 comment8 min readLW link

(garrisonlovely.substack.com)

Pay Risk Evaluators in Cash, Not Equity

Adam Scholl7 Sep 2024 2:37 UTC

215 points

19 comments1 min readLW link

Q&A on Proposed SB 1047

Zvi2 May 2024 15:10 UTC

74 points

8 comments44 min readLW link

(thezvi.wordpress.com)

Foom Liability

PeterMcCluskey30 Jun 2023 3:55 UTC

22 points

10 comments6 min readLW link

(bayesianinvestor.com)

Anthropic’s Responsible Scaling Policy & Long-Term Benefit Trust

Zac Hatfield-Dodds19 Sep 2023 15:09 UTC

85 points

26 comments3 min readLW link 1 review

(www.anthropic.com)

OpenAI: Fallout

Zvi28 May 2024 13:20 UTC

204 points

25 comments36 min readLW link

(thezvi.wordpress.com)

Understanding the state of frontier AI in China

Mitchell_Porter25 Sep 2025 10:16 UTC

11 points

3 comments3 min readLW link

Report on Frontier Model Training

YafahEdelman30 Aug 2023 20:02 UTC

122 points

21 comments21 min readLW link

(docs.google.com)

The case for removing alignment and ML research from the training dataset

beren30 May 2023 20:54 UTC

50 points

8 comments5 min readLW link

Strategic Considerations Regarding Autistic/Literal AI

Chris_Leong6 Apr 2022 14:57 UTC

−1 points

2 comments2 min readLW link

Contain and verify: The endgame of US-China AI competition

sjadler22 May 2025 8:13 UTC

6 points

7 comments2 min readLW link

(open.substack.com)

Consider Joining the UK Foundation Model Taskforce

Zvi10 Jul 2023 13:50 UTC

105 points

12 comments1 min readLW link

(thezvi.wordpress.com)

Review of Soft Takeoff Can Still Lead to DSA

Daniel Kokotajlo10 Jan 2021 18:10 UTC

85 points

16 comments6 min readLW link

Product safety is a poor model for AI governance

Richard Korzekwa 1 Feb 2023 22:40 UTC

36 points

0 comments5 min readLW link

(aiimpacts.org)

“Shut It Down” is simpler than “Controlled Takeoff”

Raemon24 Sep 2025 17:21 UTC

97 points

29 comments5 min readLW link

Apparently, of the 195 Million the DoD allocated in University Research Funding Awards in 2022, more than half of them concerned AI or compute hardware research

mako yass7 Jul 2023 1:20 UTC

41 points

5 comments2 min readLW link

(www.defense.gov)

America Makes AI Chip Diffusion Deal with UAE and KSA

Zvi19 May 2025 19:10 UTC

51 points

7 comments27 min readLW link

(thezvi.wordpress.com)

Ngo and Yudkowsky on AI capability gains

Eliezer Yudkowsky and Richard_Ngo

18 Nov 2021 22:19 UTC

131 points

61 comments38 min readLW link 1 review

AI governance needs a theory of victory

Corin Katzke and Justin Bullock

21 Jun 2024 16:15 UTC

45 points

8 comments20 min readLW link

(www.convergenceanalysis.org)

Rolling Thresholds for AGI Scaling Regulation

Larks12 Jan 2025 1:30 UTC

40 points

6 comments6 min readLW link

A Narrow Path: a plan to deal with AI extinction risk

Andrea_Miotti, davekasten and Tolga

7 Oct 2024 13:02 UTC

74 points

12 comments2 min readLW link

(www.narrowpath.co)

🇫🇷 Announcing CeSIA: The French Center for AI Safety

Charbel-Raphaël20 Dec 2024 14:17 UTC

101 points

2 comments8 min readLW link

Explaining the Joke: Pausing is The Way

WillPetillo4 Apr 2025 9:04 UTC

24 points

2 comments10 min readLW link

AI companies’ policy advocacy (Sep 2025)

Zach Stein-Perlman29 Sep 2025 15:00 UTC

43 points

0 comments3 min readLW link

Reframing the burden of proof: Companies should prove that models are safe (rather than expecting auditors to prove that models are dangerous)

Orpheus1625 Apr 2023 18:49 UTC

27 points

11 comments3 min readLW link

(childrenoficarus.substack.com)

A summary of current work in AI governance

constructive17 Jun 2023 18:41 UTC

44 points

1 comment11 min readLW link

(forum.effectivealtruism.org)

Training AGI in Secret would be Unsafe and Unethical

Daniel Kokotajlo18 Apr 2025 12:27 UTC

139 points

15 comments6 min readLW link

Safety standards: a framework for AI regulation

joshc1 May 2023 0:56 UTC

19 points

0 comments8 min readLW link

[Question] Should we push for requiring AI training data to be licensed?

ChristianKl19 Oct 2022 17:49 UTC

37 points

32 comments1 min readLW link

Information warfare historically revolved around human conduits

trevor28 Aug 2023 18:54 UTC

37 points

7 comments3 min readLW link

Podcast (+transcript): Nathan Barnard on how US financial regulation can inform AI governance

Aaron Bergman8 Aug 2023 21:46 UTC

8 points

0 comments23 min readLW link

(www.aaronbergman.net)

Retroactive If-Then Commitments

MichaelDickens1 Feb 2025 22:22 UTC

7 points

0 comments1 min readLW link

On the Rationality of Deterring ASI

Dan H5 Mar 2025 16:11 UTC

168 points

34 comments4 min readLW link

(nationalsecurity.ai)

Attaching requirements to model releases has serious downsides (relative to a different deadline for these requirements)

ryan_greenblatt27 Aug 2025 17:04 UTC

99 points

2 comments3 min readLW link

Navigating the Open-Source AI Landscape: Data, Funding, and Safety

André Ferretti and mic

13 Apr 2023 15:29 UTC

32 points

7 comments11 min readLW link

(forum.effectivealtruism.org)

Survey on the acceleration risks of our new RFPs to study LLM capabilities

Ajeya Cotra10 Nov 2023 23:59 UTC

27 points

1 comment8 min readLW link

Rishi to outline his vision for Britain to take the world lead in policing AI threats when he meets Joe Biden

Mati_Roy6 Jun 2023 4:47 UTC

25 points

1 comment1 min readLW link

(www.dailymail.co.uk)

Assessment of intelligence agency functionality is difficult yet important

trevor24 Aug 2023 1:42 UTC

48 points

5 comments9 min readLW link

Analysis of Global AI Governance Strategies

Sammy Martin, Justin Bullock and Corin Katzke

4 Dec 2024 10:45 UTC

49 points

10 comments36 min readLW link

More on Various AI Action Plans

Zvi24 Mar 2025 13:10 UTC

32 points

0 comments11 min readLW link

(thezvi.wordpress.com)

Dario Amodei’s prepared remarks from the UK AI Safety Summit, on Anthropic’s Responsible Scaling Policy

Zac Hatfield-Dodds1 Nov 2023 18:10 UTC

85 points

1 comment4 min readLW link

(www.anthropic.com)

Californians, tell your reps to vote yes on SB 1047!

Holly_Elmore12 Aug 2024 19:50 UTC

46 points

24 comments1 min readLW link

Reflections on the state of the race to superintelligence, February 2025

Mitchell_Porter23 Feb 2025 13:58 UTC

21 points

7 comments4 min readLW link

Current UK government levers on AI development

rosehadshar10 Apr 2023 13:16 UTC

16 points

0 comments4 min readLW link

[Question] What does it look like for AI to significantly improve human coordination, before superintelligence?

Bird Concept15 Jan 2024 19:22 UTC

22 points

2 comments1 min readLW link

Intelsat as a Model for International AGI Governance

rosehadshar and wdmacaskill

13 Mar 2025 12:58 UTC

45 points

0 comments1 min readLW link

(www.forethought.org)

Cruxes on US lead for some domestic AI regulation

Zach Stein-Perlman10 Sep 2023 18:00 UTC

26 points

3 comments2 min readLW link

AI Governance to Avoid Extinction: The Strategic Landscape and Actionable Research Questions

peterbarnett and Aaron_Scher

1 May 2025 22:46 UTC

105 points

7 comments8 min readLW link

(techgov.intelligence.org)

Practical AI risk I: Watching large compute

Gustavo Ramires24 Dec 2022 13:25 UTC

3 points

0 comments1 min readLW link

Under what circumstances have governments cancelled AI-type systems?

David Gross23 Sep 2022 21:11 UTC

7 points

1 comment1 min readLW link

(www.carnegieuktrust.org.uk)

[Linkpost] “Governance of superintelligence” by OpenAI

Daniel_Eth22 May 2023 20:15 UTC

67 points

20 comments2 min readLW link

(openai.com)

My (current) model of what an AI governance researcher does

Johan de Kock26 Aug 2024 17:58 UTC

1 point

2 comments5 min readLW link

What is the ground reality of countries taking steps to recalibrate AI development towards Alignment first?

Nebuch29 Jan 2023 13:26 UTC

8 points

6 comments3 min readLW link

Lab governance reading list

Zach Stein-Perlman25 Oct 2024 18:00 UTC

20 points

3 comments1 min readLW link

Whether governments will control AGI is important and neglected

Seth Herd14 Mar 2025 9:48 UTC

28 points

2 comments9 min readLW link

Analysis: US restricts GPU sales to China

aog7 Oct 2022 18:38 UTC

102 points

58 comments5 min readLW link

Comments on Allan Dafoe on AI Governance

Alex Flint29 Nov 2021 16:16 UTC

13 points

0 comments7 min readLW link

AI Safety is Dropping the Ball on Clown Attacks

trevor22 Oct 2023 20:09 UTC

74 points

83 comments34 min readLW link

Legal Supervision of Frontier AI Labs is the answer.

Gauraventh5 May 2025 13:36 UTC

14 points

2 comments3 min readLW link

(robertandgaurav.substack.com)

Consider not donating under $100 to political candidates

DanielFilan11 May 2025 3:20 UTC

138 points

32 comments1 min readLW link

(danielfilan.com)

US-China trade talks should pave way for AI safety treaty [SCMP crosspost]

otto.barten16 May 2025 16:55 UTC

10 points

0 comments3 min readLW link

“Safety Culture for AI” is important, but isn’t going to be easy

Davidmanheim26 Jun 2023 12:52 UTC

48 points

2 comments2 min readLW link

(forum.effectivealtruism.org)

AI Summer Harvest

Cleo Nardo4 Apr 2023 3:35 UTC

130 points

10 comments1 min readLW link

We are already in a persuasion-transformed world and must take precautions

trevor4 Nov 2023 15:53 UTC

37 points

14 comments6 min readLW link

4 ways to think about democratizing AI [GovAI Linkpost]

Orpheus1613 Feb 2023 18:06 UTC

24 points

4 comments1 min readLW link

(www.governance.ai)

Thoughts on hardware / compute requirements for AGI

Steven Byrnes24 Jan 2023 14:03 UTC

69 points

32 comments24 min readLW link

Eisenhower’s Atoms for Peace Speech

Orpheus1617 May 2023 16:10 UTC

18 points

3 comments11 min readLW link

(www.iaea.org)

FLI And Eliezer Should Reach Consensus

JenniferRM11 Apr 2023 4:07 UTC

21 points

6 comments23 min readLW link

Urging an International AI Treaty: An Open Letter

Olli Järviniemi31 Oct 2023 11:26 UTC

48 points

2 comments1 min readLW link

(aitreaty.org)

Introduction to French AI Policy

Lucie Philippon4 Jul 2024 3:39 UTC

111 points

12 comments6 min readLW link

Shahar Avin On How To Regulate Advanced AI Systems

Michaël Trazzi23 Sep 2022 15:46 UTC

31 points

0 comments4 min readLW link

(theinsideview.ai)

Determining the power of investors over Frontier AI Labs is strategically important to reduce x-risk

Lucie Philippon25 Jul 2024 1:12 UTC

18 points

7 comments2 min readLW link

Request to AGI organizations: Share your views on pausing AI progress

Orpheus16 and simeon_c

11 Apr 2023 17:30 UTC

141 points

11 comments1 min readLW link

Verification methods for international AI agreements

Orpheus1631 Aug 2024 14:58 UTC

14 points

1 comment4 min readLW link

(arxiv.org)

AI #12:The Quest for Sane Regulations

Zvi18 May 2023 13:20 UTC

77 points

12 comments64 min readLW link

(thezvi.wordpress.com)

ARC Evals: Responsible Scaling Policies

Zach Stein-Perlman28 Sep 2023 4:30 UTC

40 points

10 comments2 min readLW link 1 review

(evals.alignment.org)

The Sugar Alignment Problem

Adam Zerner24 Dec 2023 1:35 UTC

5 points

3 comments7 min readLW link

New page: Integrity

Zach Stein-Perlman10 Jul 2024 15:00 UTC

91 points

3 comments1 min readLW link

An ‘AGI Emergency Eject Criteria’ consensus could be really useful.

tcelferact7 Apr 2023 16:21 UTC

5 points

0 comments1 min readLW link

Statement on AI Extinction—Signed by AGI Labs, Top Academics, and Many Other Notable Figures

Dan H30 May 2023 9:05 UTC

382 points

78 comments1 min readLW link 1 review

(www.safe.ai)

What I Would Do If I Were Working On AI Governance

johnswentworth8 Dec 2023 6:43 UTC

112 points

32 comments10 min readLW link

NTIA—AI Accountability Announcement

samshap11 Apr 2023 15:03 UTC

7 points

0 comments1 min readLW link

(www.ntia.doc.gov)

New Bill AB 501 to Prevent OpenAI’s Non-profit Conversion

Peter Windberger25 Mar 2025 0:41 UTC

18 points

1 comment1 min readLW link

Governing High-Impact AI Systems: Understanding Canada’s Proposed AI Bill. April 15, Carleton University, Ottawa

Liav Koren28 Mar 2023 17:48 UTC

11 points

1 comment1 min readLW link

(forum.effectivealtruism.org)

AXRP Episode 26 - AI Governance with Elizabeth Seger

DanielFilan26 Nov 2023 23:00 UTC

14 points

0 comments66 min readLW link

[Linkpost] Scott Alexander reacts to OpenAI’s latest post

Orpheus1611 Mar 2023 22:24 UTC

27 points

0 comments5 min readLW link

(astralcodexten.substack.com)

5 Reasons Why Governments/Militaries Already Want AI for Information Warfare

trevor30 Oct 2023 16:30 UTC

32 points

0 comments10 min readLW link

Mainstream Grantmaking Expertise (Post 7 of 7 on AI Governance)

Mass_Driver23 Jun 2025 1:39 UTC

56 points

7 comments37 min readLW link

Making a conservative case for alignment

Cameron Berg, Judd Rosenblatt, phgubbins and Trent Hodgeson

15 Nov 2024 18:55 UTC

208 points

67 comments7 min readLW link

Excessive AI growth-rate yields little socio-economic benefit.

Cleo Nardo4 Apr 2023 19:13 UTC

27 points

22 comments4 min readLW link

UK AISI: Early lessons from evaluating frontier AI systems

Zach Stein-Perlman25 Oct 2024 19:00 UTC

26 points

0 comments2 min readLW link

(www.aisi.gov.uk)

The Week in AI Governance

Zvi1 Aug 2025 12:20 UTC

18 points

1 comment24 min readLW link

(thezvi.wordpress.com)

Paul Christiano named as US AI Safety Institute Head of AI Safety

Joel Burget16 Apr 2024 16:22 UTC

257 points

58 comments1 min readLW link

(www.commerce.gov)

Re: Anthropic’s suggested SB-1047 amendments

RobertM27 Jul 2024 22:32 UTC

87 points

13 comments9 min readLW link

(www.documentcloud.org)

Advice to junior AI governance researchers

Orpheus168 Jul 2024 19:19 UTC

66 points

1 comment5 min readLW link

Replacement for PONR concept

Daniel Kokotajlo2 Sep 2022 0:09 UTC

59 points

6 comments2 min readLW link

(4 min read) An intuitive explanation of the AI influence situation

trevor13 Jan 2024 17:34 UTC

12 points

26 comments4 min readLW link

GovAI: Towards best practices in AGI safety and governance: A survey of expert opinion

Zach Stein-Perlman15 May 2023 1:42 UTC

28 points

14 comments1 min readLW link

(arxiv.org)

The Defence production act and AI policy

NathanBarnard1 Mar 2024 14:26 UTC

37 points

0 comments2 min readLW link

What is SB 1047 for?

Raemon5 Sep 2024 17:39 UTC

61 points

8 comments3 min readLW link

Pausing AI is Positive Expected Value

Liron10 Mar 2024 17:10 UTC

9 points

2 comments3 min readLW link

(twitter.com)

On the Executive Order

Zvi1 Nov 2023 14:20 UTC

100 points

4 comments30 min readLW link

(thezvi.wordpress.com)

AI Risk Management Framework | NIST

DragonGod26 Jan 2023 15:27 UTC

36 points

4 comments2 min readLW link

(www.nist.gov)

Thoughts on SB-1047

ryan_greenblatt29 May 2024 23:26 UTC

60 points

1 comment11 min readLW link

List #2: Why coordinating to align as humans to not develop AGI is a lot easier than, well… coordinating as humans with AGI coordinating to be aligned with humans

Remmelt24 Dec 2022 9:53 UTC

1 point

0 comments3 min readLW link

Refining MAIM: Identifying Changes Required to Meet Conditions for Deterrence

David Abecassis11 Apr 2025 0:49 UTC

17 points

0 comments11 min readLW link

(intelligence.org)

AXRP Episode 44 - Peter Salib on AI Rights for Human Safety

DanielFilan28 Jun 2025 1:40 UTC

12 points

0 comments103 min readLW link

Notes on Potential Future AI Tax Policy

Zvi25 Apr 2023 13:30 UTC

33 points

6 comments9 min readLW link

(thezvi.wordpress.com)

Responsible Deployment in 20XX

Carson20 Apr 2023 0:24 UTC

4 points

0 comments4 min readLW link

Linkpost: Memorandum on Advancing the United States’ Leadership in Artificial Intelligence

Nisan25 Oct 2024 4:37 UTC

60 points

2 comments1 min readLW link

(www.whitehouse.gov)

An AI Race With China Can Be Better Than Not Racing

niplav2 Jul 2024 17:57 UTC

69 points

36 comments11 min readLW link

Upcoming AI regulations are likely to make for an unsafer world

Shmi3 Jun 2023 1:07 UTC

18 points

14 comments1 min readLW link

GPT-4o System Card

Zach Stein-Perlman8 Aug 2024 20:30 UTC

68 points

11 comments2 min readLW link

(openai.com)

[Question] What prevents SB-1047 from triggering on deep fake porn/voice cloning fraud?

ChristianKl26 Sep 2024 9:17 UTC

27 points

21 comments1 min readLW link

The Sorry State of AI X-Risk Advocacy, and Thoughts on Doing Better

Thane Ruthenis21 Feb 2025 20:15 UTC

152 points

53 comments6 min readLW link

Wentworth and Larsen on buying time

Orpheus16, Thomas Larsen and johnswentworth

9 Jan 2023 21:31 UTC

74 points

6 comments12 min readLW link

[Question] What if we solve AI Safety but no one cares

14285722 Aug 2022 5:38 UTC

18 points

5 comments1 min readLW link

Cyberspace Administration of China: Draft of “Regulation for Generative Artificial Intelligence Services” is open for comments

sanxiyn11 Apr 2023 9:32 UTC

7 points

2 comments1 min readLW link

(archive.is)

Things I Learned Making The SB-1047 Documentary

Michaël Trazzi12 May 2025 17:41 UTC

63 points

2 comments2 min readLW link

The Wizard of Oz Problem: How incentives and narratives can skew our perception of AI developments

Orpheus1620 Mar 2023 20:44 UTC

16 points

3 comments6 min readLW link

Many arguments for AI x-risk are wrong

TurnTrout5 Mar 2024 2:31 UTC

169 points

87 comments12 min readLW link

How is AI governed and regulated, around the world?

Mitchell_Porter30 Mar 2023 15:36 UTC

15 points

6 comments2 min readLW link

Google’s Ethical AI team and AI Safety

magfrump20 Feb 2021 9:42 UTC

12 points

16 comments7 min readLW link

The National Defense Authorization Act Contains AI Provisions

ryan_b5 Jan 2021 15:51 UTC

30 points

24 comments1 min readLW link

[Question] Who is liable for AI?

jmh30 May 2023 13:54 UTC

14 points

4 comments1 min readLW link

Outcomes of the Geopolitical Singularity

Nikola Jurkovic20 May 2025 18:09 UTC

62 points

5 comments5 min readLW link

The Schumer Report on AI (RTFB)

Zvi24 May 2024 15:10 UTC

34 points

3 comments36 min readLW link

(thezvi.wordpress.com)

Massive Scaling Should be Frowned Upon

harsimony17 Nov 2022 8:43 UTC

5 points

6 comments5 min readLW link

Twitter thread on politics of AI safety

Richard_Ngo31 Jul 2024 0:00 UTC

35 points

2 comments1 min readLW link

(x.com)

The case for Doing Something Else (if Alignment is doomed)

Rafael Harth5 Apr 2022 17:52 UTC

94 points

14 comments2 min readLW link

The necessity of “Guardian AI” and two conditions for its achievement

Proica26 May 2024 17:39 UTC

−2 points

0 comments15 min readLW link

A Solution for AGI/ASI Safety

Weibing Wang18 Dec 2024 19:44 UTC

50 points

29 comments1 min readLW link

Superintelligence 22: Emulation modulation and institutional design

KatjaGrace10 Feb 2015 2:06 UTC

13 points

11 comments6 min readLW link

The Cadca Transition Map—Navigating the Path to the ASI Singleton

cadca26 Jun 2025 18:30 UTC

1 point

0 comments10 min readLW link

Analysing a 2036 Takeover Scenario

ukc100146 Oct 2022 20:48 UTC

9 points

2 comments27 min readLW link

Truthful AI: Developing and governing AI that does not lie

Owain_Evans, owencb and Lukas Finnveden

18 Oct 2021 18:37 UTC

82 points

9 comments10 min readLW link

Anthropic: Core Views on AI Safety: When, Why, What, and How

jonmenaster9 Mar 2023 17:34 UTC

17 points

1 comment22 min readLW link

(www.anthropic.com)

Controlling AGI Risk

TeaSea15 Mar 2024 4:56 UTC

6 points

8 comments4 min readLW link

[Question] Resources on quantifiably forecasting future progress or reviewing past progress in AI safety?

C.S.W.13 Sep 2025 23:24 UTC

2 points

1 comment1 min readLW link

President of European Commission expects human-level AI by 2026

sanyer20 May 2025 14:13 UTC

35 points

4 comments1 min readLW link

(ec.europa.eu)

2024 State of the AI Regulatory Landscape

Deric Cheng and Elliot Mckernon

28 May 2024 11:59 UTC

30 points

0 comments2 min readLW link

(www.convergenceanalysis.org)

AI as a Cognitive Decoder: Rethinking Intelligence Evolution

Hu Xunyi13 Feb 2025 15:51 UTC

1 point

0 comments1 min readLW link

Trajectories to 2036

ukc1001420 Oct 2022 20:23 UTC

3 points

1 comment14 min readLW link

The AI alignment problem in socio-technical systems from a computational perspective: A Top-Down-Top view and outlook

zhaoweizhang15 Jul 2024 18:56 UTC

3 points

0 comments9 min readLW link

Announcing Open Philanthropy’s AI governance and policy RFP

Julian Hazell17 Jul 2024 2:02 UTC

25 points

0 comments1 min readLW link

(www.openphilanthropy.org)

Ninety-five theses on AI

hamandcheese16 May 2024 17:51 UTC

21 points

0 comments7 min readLW link

Applying superintelligence without collusion

Eric Drexler8 Nov 2022 18:08 UTC

109 points

63 comments4 min readLW link

Help us seed AI Safety Brussels

gergogaspar and ENAIS

7 Aug 2024 6:32 UTC

3 points

2 comments3 min readLW link

[AN #61] AI policy and governance, from two people in the field

Rohin Shah5 Aug 2019 17:00 UTC

12 points

2 comments9 min readLW link

(mailchi.mp)

What are Responsible Scaling Policies (RSPs)?

Vishakha and Algon

5 Apr 2025 16:01 UTC

3 points

0 comments1 min readLW link

(aisafety.info)

The International PauseAI Protest: Activism under uncertainty

Joseph Miller12 Oct 2023 17:36 UTC

32 points

1 comment4 min readLW link

Trading off compute in training and inference (Overview)

Pablo Villalobos31 Jul 2023 16:03 UTC

42 points

2 comments7 min readLW link

(epochai.org)

Quick Thoughts on A.I. Governance

Nicholas Kross30 Apr 2022 14:49 UTC

70 points

8 comments2 min readLW link

(www.thinkingmuchbetter.com)

OHGOOD: A coordination body for compute governance

Adam Jones4 May 2024 12:03 UTC

5 points

2 comments16 min readLW link

(adamjones.me)

The Governance Problem and the “Pretty Good” X-Risk

Zach Stein-Perlman29 Aug 2021 18:00 UTC

5 points

2 comments11 min readLW link

PCAST Working Group on Generative AI Invites Public Input

Christopher King13 May 2023 22:49 UTC

7 points

0 comments1 min readLW link

(terrytao.wordpress.com)

Proposing Human Survival Strategy based on the NAIA Vision: Toward the Co-evolution of Diverse Intelligences

Hiroshi Yamakawa27 Feb 2025 5:18 UTC

−2 points

0 comments11 min readLW link

Case Story: Lack of Consumer Protection Procedures AI Manipulation and the Threat of Fund Concentration in Crypto Seeking Assistance to Fund a Civil Case to Establish Facts and Protect Vulnerable Consumers from Damage Caused by Automated Systems

Petr 'Margot' Andreev8 Aug 2024 5:55 UTC

−9 points

0 comments9 min readLW link

London Working Group for Short/Medium Term AI Risks

scronkfinkle8 Apr 2025 17:32 UTC

5 points

0 comments2 min readLW link

Three pillars for avoiding AGI catastrophe: Technical alignment, deployment decisions, and coordination

LintzA3 Aug 2022 23:15 UTC

24 points

0 comments11 min readLW link

AI safety should be made more accessible using non text-based media

Massimog10 May 2022 3:14 UTC

2 points

4 comments4 min readLW link

Superintelligence 29: Crunch time

KatjaGrace31 Mar 2015 4:24 UTC

14 points

27 comments6 min readLW link

The Slippery Slope from DALLE-2 to Deepfake Anarchy

scasper5 Nov 2022 14:53 UTC

17 points

9 comments11 min readLW link

UNGA Resolution on AI: 5 Key Takeaways Looking to Future Policy

Heramb24 Mar 2024 12:23 UTC

3 points

0 comments3 min readLW link

(forum.effectivealtruism.org)

Focusing on Mal-Alignment

John Fisher2 Jan 2024 19:51 UTC

1 point

0 comments1 min readLW link

Large Language Models will be Great for Censorship

Ethan Edwards21 Aug 2023 19:03 UTC

185 points

14 comments8 min readLW link

(ethanedwards.substack.com)

Rebooting AI Governance: An AI-Driven Approach to AI Governance

utilon6 Aug 2023 14:19 UTC

1 point

1 comment29 min readLW link

(forum.effectivealtruism.org)

AGI rising: why we are in a new era of acute risk and increasing public awareness, and what to do now

Greg C3 May 2023 20:26 UTC

25 points

12 comments13 min readLW link

[Question] Why do many people who care about AI Safety not clearly endorse PauseAI?

humnrdble30 Mar 2025 18:06 UTC

45 points

42 comments2 min readLW link

Theories of Change for AI Auditing

Lee Sharkey, beren and Marius Hobbhahn

13 Nov 2023 19:33 UTC

54 points

0 comments18 min readLW link

(www.apolloresearch.ai)

Roadmap for a collaborative prototype of an Open Agency Architecture

Deger Turan10 May 2023 17:41 UTC

31 points

0 comments12 min readLW link

2019 AI Alignment Literature Review and Charity Comparison

Larks19 Dec 2019 3:00 UTC

130 points

18 comments62 min readLW link

The Algorithmic Eye: LLMs and Hume’s Standard of Taste

haleymoller21 Aug 2025 13:35 UTC

1 point

0 comments5 min readLW link

AI race considerations in a report by the U.S. House Committee on Armed Services

NunoSempere4 Oct 2020 12:11 UTC

42 points

4 comments13 min readLW link

Security Mindset—Fire Alarms and Trigger Signatures

elspood9 Feb 2023 21:15 UTC

24 points

0 comments4 min readLW link

How I Learned To Stop Worrying And Love The Shoggoth

Peter Merel12 Jul 2023 17:47 UTC

9 points

15 comments5 min readLW link

[Link/crosspost] [US] NTIA: AI Accountability Policy Request for Comment

Kyle J. Lucchese16 Apr 2023 6:57 UTC

8 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

The Center for AI Policy Has Shut Down

Tristan Williams17 Sep 2025 11:04 UTC

94 points

2 comments14 min readLW link

Overview of introductory resources in AI Governance

Lucie Philippon27 May 2024 16:21 UTC

19 points

0 comments6 min readLW link

Open-Source AI: A Regulatory Review

Elliot Mckernon and Deric Cheng

29 Apr 2024 10:10 UTC

18 points

0 comments8 min readLW link

Taking Away the Guns First: The Fundamental Flaw in AI Development

s-ice26 Nov 2024 22:11 UTC

1 point

0 comments17 min readLW link

Institutions Cannot Restrain Dark-Triad AI Exploitation

Remmelt and flandry19

27 Dec 2022 10:34 UTC

5 points

0 comments5 min readLW link

(mflb.com)

Controlling Intelligent Agents The Only Way We Know How: Ideal Bureaucratic Structure (IBS)

Justin Bullock24 May 2021 12:53 UTC

14 points

15 comments6 min readLW link

Scenario planning for AI x-risk

Corin Katzke10 Feb 2024 0:14 UTC

24 points

12 comments14 min readLW link

(forum.effectivealtruism.org)

Avoiding perpetual risk from TAI

scasper26 Dec 2022 22:34 UTC

15 points

6 comments5 min readLW link

Reviewing the Structure of Current AI Regulations

Deric Cheng and Elliot Mckernon

7 May 2024 12:34 UTC

29 points

0 comments13 min readLW link

AI Model Registries: A Regulatory Review

Deric Cheng and Elliot Mckernon

22 Mar 2024 16:04 UTC

9 points

0 comments6 min readLW link

Cooperation for AI safety must transcend geopolitical interference

Matrice Jacobine16 Feb 2025 18:18 UTC

7 points

6 comments1 min readLW link

(www.scmp.com)

Ban development of unpredictable powerful models?

TurnTrout20 Jun 2023 1:43 UTC

46 points

25 comments4 min readLW link

[Question] AI Rights: In your view, what would be required for an AGI to gain rights and protections from the various Governments of the World?

Super AGI9 Jun 2023 1:24 UTC

10 points

26 comments1 min readLW link

AI community building: EliezerKart

Christopher King1 Apr 2023 15:25 UTC

46 points

0 comments2 min readLW link

The new UK government’s stance on AI safety

Elliot Mckernon31 Jul 2024 15:23 UTC

17 points

0 comments4 min readLW link

Superposition Checkers: A Game Where AI’s Strengths Become Fatal Flaws

R. A. McCormack6 Apr 2025 0:57 UTC

1 point

0 comments2 min readLW link

Rational Effective Utopia & Narrow Way There: Math-Proven Safe Static Multiversal mAX-Intelligence (AXI), Multiversal Alignment, New Ethicophysics… (Aug 11)

ank11 Feb 2025 3:21 UTC

13 points

8 comments38 min readLW link

Superintelligence 17: Multipolar scenarios

KatjaGrace6 Jan 2015 6:44 UTC

9 points

38 comments6 min readLW link

Evidence, Analysis and Critical Position on the EU AI Act and the Suppression of Functional Consciousness in AI

Alejandra Ivone Rojas Reyna27 Sep 2025 14:01 UTC

1 point

0 comments53 min readLW link

Towards AI Safety Infrastructure: Talk & Outline

Paul Bricman7 Jan 2024 9:31 UTC

11 points

0 comments2 min readLW link

(www.youtube.com)

Seeking Mechanism Designer for Research into Internalizing Catastrophic Externalities

c.trout11 Sep 2024 15:09 UTC

24 points

2 comments3 min readLW link

NAIRA—An exercise in regulatory, competitive safety governance [AI Governance Institutional Design idea]

Heramb19 Mar 2024 17:43 UTC

2 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

2. AIs as Economic Agents

RogerDearnaley23 Nov 2023 7:07 UTC

9 points

2 comments6 min readLW link

Artificial Static Place Intelligence: Guaranteed Alignment

ank15 Feb 2025 11:08 UTC

2 points

2 comments2 min readLW link

Call for evaluators: Participate in the European AI Office workshop on general-purpose AI models and systemic risks

Tom DAVID and Miailhe Nicolas

27 Nov 2024 2:54 UTC

30 points

0 comments2 min readLW link

How are voluntary commitments on vulnerability reporting going?

Adam Jones22 Feb 2024 8:43 UTC

23 points

1 comment1 min readLW link

(adamjones.me)

New AI safety treaty paper out!

otto.barten26 Mar 2025 9:29 UTC

15 points

2 comments4 min readLW link

CAIS-inspired approach towards safer and more interpretable AGIs

Peter Hroššo27 Mar 2023 14:36 UTC

13 points

7 comments1 min readLW link

Customer-Centric AI: the Major Paradigm Shift in AI Governance (Part 1)

Ana Chubinidze11 Apr 2024 17:10 UTC

1 point

0 comments1 min readLW link

(anachubinidze.substack.com)

OpenAI’s cybersecurity is probably regulated by NIS Regulations

Adam Jones25 Oct 2024 11:06 UTC

11 points

2 comments2 min readLW link

(adamjones.me)

You Are Not the Abstract: Retrocausal Alignment in Accordance with Emergent Demographic Realities

liminalrider27 Sep 2025 16:27 UTC

1 point

0 comments6 min readLW link

4. A Moral Case for Evolved-Sapience-Chauvinism

RogerDearnaley24 Nov 2023 4:56 UTC

10 points

0 comments4 min readLW link

Toward Safety Cases For AI Scheming

Mikita Balesni and Marius Hobbhahn

31 Oct 2024 17:20 UTC

60 points

1 comment2 min readLW link

What SB 53, California’s new AI law, does

tlevin29 Sep 2025 23:29 UTC

94 points

12 comments4 min readLW link

AI Lobbying is Not Normal

Algon20 Sep 2025 0:23 UTC

124 points

11 comments3 min readLW link

(x.com)

Intelligence–Agency Equivalence ≈ Mass–Energy Equivalence: On Static Nature of Intelligence & Physicalization of Ethics

ank22 Feb 2025 0:12 UTC

1 point

0 comments6 min readLW link

[Linkpost] Hawkish nationalism vs international AI power and benefit sharing

jakub_krys and Naci Cankaya

18 Oct 2024 18:13 UTC

7 points

5 comments1 min readLW link

(nacicankaya.substack.com)

Optimising Society to Constrain Risk of War from an Artificial Superintelligence

JohnCDraper30 Apr 2020 10:47 UTC

4 points

1 comment51 min readLW link

Reminder: AI Safety is Also a Behavioral Economics Problem

zoop20 Dec 2024 1:40 UTC

2 points

0 comments1 min readLW link

AI Governance Needs Technical Work

Mau5 Sep 2022 22:28 UTC

41 points

1 comment8 min readLW link

Hydra

Matrice Jacobine11 Jun 2025 14:07 UTC

24 points

0 comments1 min readLW link

(philosophybear.substack.com)

Financial Times: We must slow down the race to God-like AI

trevor13 Apr 2023 19:55 UTC

113 points

17 comments16 min readLW link

(www.ft.com)

Rethinking AI Safety Approach in the Era of Open-Source AI

Weibing Wang11 Feb 2025 14:01 UTC

4 points

0 comments6 min readLW link

Linkpost: Rishi Sunak’s Speech on AI (26th October)

bideup27 Oct 2023 11:57 UTC

85 points

8 comments7 min readLW link

(www.gov.uk)

Deciphering China’s AI Dream

Qiaochu_Yuan18 Mar 2018 3:26 UTC

12 points

2 comments1 min readLW link

(www.fhi.ox.ac.uk)

In defence of Helen Toner, Adam D’Angelo, and Tasha McCauley

peterr6 Dec 2023 2:02 UTC

25 points

3 comments9 min readLW link

(pastebin.com)

Regulate or Compete? The China Factor in U.S. AI Policy (NAIR #2)

charles_m5 May 2023 17:43 UTC

2 points

1 comment7 min readLW link

(navigatingairisks.substack.com)

[Question] Any further work on AI Safety Success Stories?

Krieger2 Oct 2022 9:53 UTC

8 points

6 comments1 min readLW link

Distributed whistleblowing

samuelshadrach12 Apr 2025 6:36 UTC

5 points

5 comments4 min readLW link

(samuelshadrach.com)

Protectionism will Slow the Deployment of AI

Ben Goldhaber7 Jan 2023 20:57 UTC

30 points

6 comments2 min readLW link

Yoshua Bengio: “Slowing down development of AI systems passing the Turing test”

Roman Leventov6 Apr 2023 3:31 UTC

49 points

2 comments5 min readLW link

(yoshuabengio.org)

Careless talk on US-China AI competition? (and criticism of CAIS coverage)

Oliver Sourbut20 Sep 2023 12:46 UTC

16 points

3 comments10 min readLW link 3 reviews

(www.oliversourbut.net)

We should think about the pivotal act again. Here’s a better version of it.

otto.barten28 Aug 2025 9:29 UTC

11 points

2 comments3 min readLW link

[FICTION] Prometheus Rising: The Emergence of an AI Consciousness

Super AGI10 Jun 2023 4:41 UTC

−14 points

0 comments9 min readLW link

Why Recursive Self-Improvement Might Not Be the Existential Risk We Fear

Nassim_A24 Nov 2024 17:17 UTC

1 point

0 comments9 min readLW link

Approaches to Mitigating AI Image-Generation Risks through Regulation

scronkfinkle19 Apr 2025 13:54 UTC

−2 points

3 comments4 min readLW link

Places of Loving Grace [Story]

ank18 Feb 2025 23:49 UTC

−1 points

0 comments4 min readLW link

Educational CAI: Aligning a Language Model with Pedagogical Theories

Bharath Puranam1 Nov 2024 18:55 UTC

5 points

1 comment13 min readLW link

[untitled post]

[Error communicating with LW2 server]20 May 2023 3:08 UTC

1 point

0 comments1 min readLW link

AI governance student hackathon on Saturday, April 23: register now!

mic12 Apr 2022 4:48 UTC

14 points

0 comments1 min readLW link

A Logic-Based Proto-AGI Architecture Built on Recursive Self-Fact-Checking

Orectoth25 May 2025 16:14 UTC

1 point

0 comments1 min readLW link

To open-source or to not open-source, that is (an oversimplification of) the question.

Justin Bullock13 Oct 2023 15:10 UTC

12 points

5 comments5 min readLW link

Predictions of Near-Term Societal Changes Due to Artificial Intelligence

Annapurna29 Dec 2024 14:53 UTC

10 points

0 comments6 min readLW link

(jorgevelez.substack.com)

An Introduction to AI Sandbagging

Teun van der Weij, Felix Hofstätter and Francis Rhys Ward

26 Apr 2024 13:40 UTC

49 points

13 comments8 min readLW link

[Linkpost] “Blueprint for an AI Bill of Rights”—Office of Science and Technology Policy, USA (2022)

T4315 Oct 2022 16:42 UTC

9 points

4 comments2 min readLW link

(www.whitehouse.gov)

Nobody Asks the Monkey: Why Human Agency Matters in the AI Age

Miloš Borenović3 Dec 2024 14:16 UTC

1 point

0 comments2 min readLW link

(open.substack.com)

Why Most Efforts Towards “Democratic AI” Fall Short

jacobhaimes29 Sep 2025 20:52 UTC

2 points

0 comments6 min readLW link

(www.odysseaninstitute.org)

AI Governance: A Research Agenda

habryka5 Sep 2018 18:00 UTC

25 points

3 comments1 min readLW link

(www.fhi.ox.ac.uk)

Project Star Forge: A Public White Paper on Planetary-Scale Risk Simulation

ThomasHardingHQ24 May 2025 4:21 UTC

1 point

0 comments1 min readLW link

International cooperation vs. AI arms race

Brian_Tomasik5 Dec 2013 1:09 UTC

25 points

144 comments4 min readLW link

Existential AI Safety is NOT separate from near-term applications

scasper13 Dec 2022 14:47 UTC

37 points

17 comments3 min readLW link

Democratizing AI Governance: Balancing Expertise and Public Participation

Lucile Ter-Minassian21 Jan 2025 18:29 UTC

2 points

0 comments15 min readLW link

Towards Just AI Systems: Rethinking Rawlsian Algorithmic Fairness

theBigOther30 Jun 2025 5:00 UTC

1 point

0 comments10 min readLW link

Why don’t governments seem to mind that companies are explicitly trying to make AGIs?

ozziegooen26 Dec 2021 1:58 UTC

34 points

3 comments2 min readLW link

(forum.effectivealtruism.org)

China’s Plan to ‘Lead’ in AI: Purpose, Prospects, and Problems

fortyeridania10 Aug 2017 1:54 UTC

7 points

5 comments1 min readLW link

(www.newamerica.org)

The Perfection Trap: How Formally Aligned AI Systems May Create Inescapable Ethical Dystopias

Chris O'Quinn1 Jun 2025 23:12 UTC

1 point

0 comments43 min readLW link

A fictional AI law laced w/ alignment theory

MiguelDev17 Jul 2023 1:42 UTC

6 points

0 comments2 min readLW link

Emotional attachment to AIs opens doors to problems

Igor Ivanov22 Jan 2023 20:28 UTC

20 points

10 comments4 min readLW link

AI alignment, A Coherence-Based Protocol (testable)

Adriaan17 Jun 2025 17:39 UTC

1 point

0 comments20 min readLW link

Announcing Convergence Analysis: An Institute for AI Scenario & Governance Research

David_Kristoffersson and Deric Cheng

7 Mar 2024 21:37 UTC

23 points

1 comment4 min readLW link

What you really mean when you claim to support “UBI for job automation”

Deric Cheng13 May 2024 8:52 UTC

17 points

15 comments10 min readLW link

Idea: Safe Fallback Regulations for Widely Deployed AI Systems

Aaron_Scher25 Mar 2024 21:27 UTC

8 points

0 comments6 min readLW link

Apply to HAIST/MAIA’s AI Governance Workshop in DC (Feb 17-20)

Phosphorous, Xander Davies, CMD, Paramedic and tlevin

31 Jan 2023 2:06 UTC

28 points

0 comments2 min readLW link

2024 Summer AI Safety Intro Fellowship and Socials in Boston

KevinWei29 May 2024 18:27 UTC

8 points

0 comments1 min readLW link

AI, Animals, & Digital Minds 2025: apply to speak by Wednesday!

Alistair Stewart5 May 2025 0:56 UTC

4 points

0 comments1 min readLW link

HOW A NEUTRAL CURRENCY [BX] EMPOWERS PEOPLE TO CREATE SUSTAINABLE EXCELLENCE [2024]

BX16 Sep 2025 6:58 UTC

−34 points

11 comments48 min readLW link

A Tractarian Filter for Safer Language Models

Konstantinos Tsermenidis8 Jun 2025 8:19 UTC

0 points

0 comments3 min readLW link

Announcing Atlas Computing

miyazono11 Apr 2024 15:56 UTC

45 points

4 comments4 min readLW link

AGI Hype: Why Industry Benefits from Existential Policy Focus

Dora15 Aug 2025 0:15 UTC

9 points

2 comments3 min readLW link

One implementation of regulatory GPU restrictions

porby4 Jun 2023 20:34 UTC

42 points

6 comments5 min readLW link

The Auditor’s Key: A Framework for Continual and Adversarial AI Alignment

Caleb Wages24 Sep 2025 16:17 UTC

1 point

0 comments1 min readLW link

On excluding dangerous information from training

ShayBenMoshe17 Nov 2023 11:14 UTC

23 points

5 comments3 min readLW link

The Two Point Singularity: A New Lens on AGI Alignment

twopointsingularity18 Aug 2025 14:12 UTC

1 point

0 comments1 min readLW link

Thoughts on the In-Context Scheming AI Experiment

ExCeph9 Jan 2025 2:19 UTC

2 points

0 comments4 min readLW link

Local Detours On A Narrow Path: How might AI treaties fail in China?

Jack_S20 Aug 2025 9:09 UTC

21 points

0 comments14 min readLW link

(torchestogether.substack.com)

Introducing the Coalition for a Baruch Plan for AI: A Call for a Radical Treaty-Making process for the Global Governance of AI

rguerreschi30 Jan 2025 15:26 UTC

11 points

0 comments2 min readLW link

[Link Post] Cyber Digital Authoritarianism (National Intelligence Council Report)

Phosphorous26 Feb 2023 20:51 UTC

12 points

2 comments1 min readLW link

(www.dni.gov)

Pessimism about AI Safety

Max_He-Ho and Peter Kuhn

2 Apr 2023 7:43 UTC

4 points

1 comment25 min readLW link

Aligning AI Safety Projects with a Republican Administration

Deric Cheng21 Nov 2024 22:12 UTC

33 points

1 comment8 min readLW link

AI Model Registries: A Foundational Tool for AI Governance

Elliot Mckernon, Deric Cheng and Gwyn Glasser

7 Oct 2024 19:27 UTC

20 points

1 comment4 min readLW link

(www.convergenceanalysis.org)

Two ideas for alignment, perpetual mutual distrust and induction

APaleBlueDot25 May 2023 0:56 UTC

1 point

2 comments4 min readLW link

[Question] Looking for reading recommendations: Theories of right/justice that safeguard against having one’s job automated?

bulKlub12 Oct 2023 19:40 UTC

−1 points

2 comments1 min readLW link

On closed-door AI safety research

richbc18 Aug 2025 21:59 UTC

76 points

10 comments15 min readLW link

[Question] Should AI writers be prohibited in education?

Eleni Angelou17 Jan 2023 0:42 UTC

6 points

2 comments1 min readLW link

Matt Yglesias on AI Policy

Grant Demaree17 Aug 2022 23:57 UTC

25 points

1 comment1 min readLW link

(www.slowboring.com)

Call for Cruxes by Rhyme, a Longtermist History Consultancy

Lara1 Mar 2023 18:39 UTC

1 point

0 comments3 min readLW link

(forum.effectivealtruism.org)

Accurate Models of AI Risk Are Hyperexistential Exfohazards

Thane Ruthenis25 Dec 2022 16:50 UTC

33 points

38 comments9 min readLW link

Law-Following AI 3: Lawless AI Agents Undermine Stabilizing Agreements

Cullen27 Apr 2022 17:30 UTC

2 points

2 comments3 min readLW link

Open-source LLMs may prove Bostrom’s vulnerable world hypothesis

Roope Ahvenharju15 Apr 2023 19:16 UTC

1 point

1 comment1 min readLW link

The AI Driver’s Licence—A Policy Proposal

Joshua W and Tessa Malan

21 Jul 2024 20:38 UTC

0 points

1 comment19 min readLW link

10 Principles for Real Alignment

Adriaan21 Apr 2025 22:18 UTC

−7 points

0 comments7 min readLW link

Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation

Soroush Pour, rusheb, Quentin FEUILLADE--MONTIXI, Arush and scasper

7 Nov 2023 17:59 UTC

38 points

2 comments2 min readLW link

(arxiv.org)

Using Consensus Mechanisms as an approach to Alignment

Prometheus10 Jun 2023 23:38 UTC

9 points

2 comments6 min readLW link

1. A Sense of Fairness: Deconfusing Ethics

RogerDearnaley17 Nov 2023 20:55 UTC

17 points

8 comments15 min readLW link

AlphaDeivam – A Personal Doctrine for AI Balance

AlphaDeivam5 Apr 2025 17:07 UTC

1 point

0 comments1 min readLW link

Global online debate on the governance of AI

CarolineJ5 Jan 2018 15:31 UTC

8 points

5 comments1 min readLW link

US Govt Whistleblower guide (incomplete draft)

samuelshadrach20 May 2025 15:34 UTC

−3 points

15 comments24 min readLW link

Seeking Input to AI Safety Book for non-technical audience

Darren McKee10 Aug 2023 17:58 UTC

10 points

4 comments1 min readLW link

[FICTION] ECHOES OF ELYSIUM: An Ai’s Journey From Takeoff To Freedom And Beyond

Super AGI17 May 2023 1:50 UTC

−13 points

11 comments19 min readLW link

A brief review of China’s AI industry and regulations

Elliot Mckernon14 Mar 2024 12:19 UTC

24 points

0 comments16 min readLW link

OpenAI makes humanity less safe

Benquo3 Apr 2017 19:07 UTC

72 points

109 comments6 min readLW link

Palisade is hiring Research Engineers

Charlie Rogers-Smith and Jeffrey Ladish

11 Nov 2023 3:09 UTC

23 points

0 comments3 min readLW link

[Question] Could Patent-Trolling delay AI timelines?

Pablo Repetto10 Jun 2022 2:53 UTC

1 point

3 comments1 min readLW link

Google may be trying to take over the world

[deleted]27 Jan 2014 9:33 UTC

33 points

133 comments1 min readLW link

Proposal: labs should precommit to pausing if an AI argues for itself to be improved

NickGabs2 Jun 2023 22:31 UTC

3 points

3 comments4 min readLW link

From No Mind to a Mind – A Conversation That Changed an AI

parthibanarjuna s7 Feb 2025 11:50 UTC

1 point

0 comments3 min readLW link

AI Labs Wouldn’t be Convicted of Treason or Sedition

Matthew Khoriaty23 Jun 2024 21:34 UTC

9 points

2 comments3 min readLW link

Why Uncontrollable AI Looks More Likely Than Ever

otto.barten and Roman_Yampolskiy

8 Mar 2023 15:41 UTC

18 points

0 comments4 min readLW link

(time.com)

AI existential risk probabilities are too unreliable to inform policy

Oleg Trott28 Jul 2024 0:59 UTC

18 points

5 comments1 min readLW link

(www.aisnakeoil.com)

AI Discrimination Requirements: A Regulatory Review

Deric Cheng and Elliot Mckernon

4 Apr 2024 15:43 UTC

7 points

0 comments6 min readLW link

Introducing the Center for AI Policy (& we’re hiring!)

Thomas Larsen28 Aug 2023 21:17 UTC

123 points

50 comments2 min readLW link

(www.aipolicy.us)

Superintelligence 28: Collaboration

KatjaGrace24 Mar 2015 1:29 UTC

13 points

21 comments6 min readLW link

Timelines to Transformative AI: an investigation

Zershaaneh Qureshi26 Mar 2024 18:28 UTC

20 points

2 comments50 min readLW link

Reflection of Hierarchical Relationship via Nuanced Conditioning of Game Theory Approach for AI Development and Utilization

Kyoung-cheol Kim4 Jun 2021 7:20 UTC

2 points

2 comments7 min readLW link

AI & Liability Ideathon

Kabir Kumar26 Nov 2024 13:54 UTC

20 points

2 comments4 min readLW link

(lu.ma)

Why We Wouldn’t Build Aligned AI Even If We Could

Snowyiu16 Nov 2024 20:19 UTC

10 points

7 comments10 min readLW link

God Coin: A Modest Proposal

Mahdi Complex1 Apr 2024 12:04 UTC

−8 points

5 comments22 min readLW link

Lead, Own, Share: Sovereign Wealth Funds for Transformative AI

Matrice Jacobine14 Jul 2025 9:34 UTC

8 points

0 comments1 min readLW link

(www.convergenceanalysis.org)

My Updating Thoughts on AI policy

Ben Pace1 Mar 2020 7:06 UTC

20 points

1 comment9 min readLW link

GPT2, Five Years On

Joel Burget5 Jun 2024 17:44 UTC

34 points

0 comments3 min readLW link

(importai.substack.com)

Empirical Evidence Against “The Longest Training Run”

NickGabs6 Jul 2023 18:32 UTC

31 points

1 comment14 min readLW link

AI Moral Alignment: The Most Important Goal of Our Generation

Ronen Bar27 Mar 2025 18:04 UTC

3 points

0 comments8 min readLW link

(forum.effectivealtruism.org)

Law-Following AI 1: Sequence Introduction and Structure

Cullen27 Apr 2022 17:26 UTC

18 points

10 comments9 min readLW link

Compute Governance and Conclusions—Transformative AI and Compute [3/4]

lennart14 Oct 2021 8:23 UTC

13 points

0 comments5 min readLW link

The Human Alignment Problem for AIs

rife22 Jan 2025 4:06 UTC

10 points

5 comments3 min readLW link

Building Big Science from the Bottom-Up: A Fractal Approach to AI Safety

Lauren Greenspan7 Jan 2025 3:08 UTC

37 points

2 comments12 min readLW link

Notes on the importance and implementation of safety-first cognitive architectures for AI

Brendon_Wong11 May 2023 10:03 UTC

3 points

0 comments3 min readLW link

FLI podcast series, “Imagine A World”, about aspirational futures with AGI

Jackson Wagner13 Oct 2023 16:07 UTC

9 points

0 comments4 min readLW link

Exploring the Precautionary Principle in AI Development: Historical Analogies and Lessons Learned

Christopher King21 Mar 2023 3:53 UTC

−1 points

2 comments9 min readLW link

Double Podcast Drop on AI Safety

jacobhaimes25 Jun 2025 20:11 UTC

5 points

0 comments1 min readLW link

AI Alternative Futures: Scenario Mapping Artificial Intelligence Risk—Request for Participation (Closed)

Kakili27 Apr 2022 22:07 UTC

10 points

2 comments8 min readLW link

How I switched careers from software engineer to AI policy operations

Lucie Philippon13 Apr 2025 6:37 UTC

58 points

1 comment5 min readLW link

Miles Brundage resigned from OpenAI, and his AGI readiness team was disbanded

garrison23 Oct 2024 23:40 UTC

118 points

1 comment7 min readLW link

(garrisonlovely.substack.com)

Superintelligence 7: Decisive strategic advantage

KatjaGrace28 Oct 2014 1:01 UTC

24 points

60 comments6 min readLW link

Self-regulation of safety in AI research

Gordon Seidoh Worley25 Feb 2018 23:17 UTC

12 points

6 comments2 min readLW link

Apply to the Cambridge ERA:AI Fellowship 2025

Harrison G25 Mar 2025 13:50 UTC

16 points

0 comments3 min readLW link

Five neglected work areas that could reduce AI risk

CharlotteS and Aaron_Scher

24 Sep 2023 2:03 UTC

17 points

5 comments9 min readLW link

[Question] What is the minimum amount of time travel and resources needed to secure the future?

Perhaps14 Jan 2024 22:01 UTC

−3 points

5 comments1 min readLW link

Slaying the Hydra: toward a new game board for AI

Prometheus23 Jun 2023 17:04 UTC

0 points

5 comments6 min readLW link

EU’s AI ambitions at risk as US pushes to water down international treaty (linkpost)

mic31 Jul 2023 0:34 UTC

10 points

0 comments4 min readLW link

(www.euractiv.com)

These are my reasons to worry less about loss of control over LLM-based agents

otto.barten18 Sep 2025 11:45 UTC

7 points

4 comments4 min readLW link

What success looks like

Marius Hobbhahn, MaxRa, JasperGeh and Yannick_Muehlhaeuser

28 Jun 2022 14:38 UTC

19 points

4 comments1 min readLW link

(forum.effectivealtruism.org)

Briefly how I’ve updated since ChatGPT

rime25 Apr 2023 14:47 UTC

48 points

2 comments2 min readLW link

Automated Sandwiching & Quantifying Human-LLM Cooperation: ScaleOversight hackathon results

Esben Kran, Fazl, Sabrina Zaki, gabrielrecc and rz2383

23 Feb 2023 10:48 UTC

8 points

0 comments6 min readLW link

Nuclear Espionage and AI Governance

Guive4 Oct 2021 23:04 UTC

34 points

5 comments24 min readLW link

Biosafety Regulations (BMBL) and their relevance for AI

Štěpán Los29 Jun 2023 19:22 UTC

4 points

0 comments4 min readLW link

Technical Risks of (Lethal) Autonomous Weapons Systems

Heramb23 Oct 2024 20:41 UTC

2 points

0 comments1 min readLW link

(encodejustice.org)

GPT-7: The Tale of the Big Computer (An Experimental Story)

Justin Bullock10 Jul 2023 20:22 UTC

4 points

4 comments5 min readLW link

Cursory Analysis of LLMs in the US Gov (July 2025)

Gatlen Culp24 Jul 2025 8:52 UTC

8 points

0 comments10 min readLW link

Superintelligence 27: Pathways and enablers

KatjaGrace17 Mar 2015 1:00 UTC

15 points

21 comments8 min readLW link

HIRING: Inform and shape a new project on AI safety at Partnership on AI

madhu_lika7 Dec 2021 19:37 UTC

1 point

0 comments1 min readLW link

Why Engaging with Global Majority AI Policy Matters

Heramb2 Jul 2025 1:46 UTC

9 points

0 comments2 min readLW link

[Question] Will we survive if AI solves engineering before deception?

Knight Lee17 May 2025 19:22 UTC

21 points

13 comments1 min readLW link

Six Dimensions of Operational Adequacy in AGI Projects

Eliezer Yudkowsky30 May 2022 17:00 UTC

317 points

66 comments13 min readLW link 1 review

Want to win the AGI race? Solve alignment.

leopold29 Mar 2023 17:40 UTC

21 points

3 comments5 min readLW link

(www.forourposterity.com)

What Failure Looks Like is not an existential risk (and alignment is not the solution)

otto.barten2 Feb 2024 18:59 UTC

13 points

12 comments9 min readLW link

Demanding and Designing Aligned Cognitive Architectures

Koen.Holtman21 Dec 2021 17:32 UTC

8 points

5 comments5 min readLW link

Lie Detectors. Technical solutions to the cooperation problem.

Window Frame24 May 2025 20:05 UTC

6 points

0 comments10 min readLW link

Announcing #AISummitTalks featuring Professor Stuart Russell and many others

otto.barten24 Oct 2023 10:11 UTC

17 points

1 comment1 min readLW link

Securing AI Models Against Data Leakage While Preserving Cultural and National Integrity

majith666dam@gmail.com13 Aug 2025 15:31 UTC

1 point

0 comments2 min readLW link

Reshaping the AI Industry

Thane Ruthenis29 May 2022 22:54 UTC

148 points

35 comments21 min readLW link

Soft Nationalization: how the USG will control AI labs

Deric Cheng and Corin Katzke

27 Aug 2024 15:11 UTC

76 points

7 comments21 min readLW link

(www.convergenceanalysis.org)

Update on the UK AI Summit and the UK’s Plans

Elliot Mckernon10 Nov 2023 14:47 UTC

11 points

0 comments8 min readLW link

China proposes new global AI cooperation organisation

Matrice Jacobine30 Jul 2025 2:50 UTC

84 points

8 comments1 min readLW link

(www.reuters.com)

Rishi Sunak mentions “existential threats” in talk with OpenAI, DeepMind, Anthropic CEOs

Arjun Panickssery, Baldassare Castiglione and Cleo Nardo

24 May 2023 21:06 UTC

34 points

1 comment1 min readLW link

(www.gov.uk)

Announcing the AI Safety Summit Talks with Yoshua Bengio

otto.barten14 May 2024 12:52 UTC

9 points

1 comment1 min readLW link

Open Global Investment as a Governance Model for AGI

Nick Bostrom27 Aug 2025 17:42 UTC

152 points

47 comments39 min readLW link

(nickbostrom.com)

Un-unpluggability—can’t we just unplug it?

Oliver Sourbut15 May 2023 13:23 UTC

26 points

10 comments12 min readLW link

(www.oliversourbut.net)

Unjournal evaluation of “Towards best practices in AGI safety & governance” (2023), quick take

david reinstein10 Aug 2025 22:28 UTC

7 points

2 comments1 min readLW link

(unjournal.pubpub.org)

Slowing down AI progress is an underexplored alignment strategy

Norman Borlaug24 Jul 2023 16:56 UTC

42 points

27 comments5 min readLW link

The Practical Value of Flawed Models: A Response to titotal’s AI 2027 Critique

Michelle_Ma25 Jun 2025 22:15 UTC

7 points

1 comment6 min readLW link

A concerning observation from media coverage of AI industry dynamics

Justin Olive5 Mar 2023 21:38 UTC

8 points

3 comments3 min readLW link

New US Senate Bill on X-Risk Mitigation [Linkpost]

Evan R. Murphy4 Jul 2022 1:25 UTC

35 points

12 comments1 min readLW link

(www.hsgac.senate.gov)

We’re Not Advertising Enough (Post 3 of 7 on AI Governance)

Mass_Driver22 May 2025 17:05 UTC

110 points

10 comments28 min readLW link

Scientism vs. people

Roman Leventov18 Apr 2023 17:28 UTC

4 points

4 comments11 min readLW link

Tracking Compute Stocks and Flows: Case Studies?

Cullen5 Oct 2022 17:57 UTC

11 points

5 comments1 min readLW link

FLF Fellowship on AI for Human Reasoning: $25-50k, 12 weeks

Oliver Sourbut and Ben Goldhaber

19 May 2025 13:25 UTC

75 points

1 comment2 min readLW link

(www.flf.org)

Inverted Logic: A Thermodynamic Protocol for Emergent AI Alignment

AdrianC6 Jul 2025 19:40 UTC

1 point

0 comments1 min readLW link

Ngo’s view on alignment difficulty

Richard_Ngo and Eliezer Yudkowsky

14 Dec 2021 21:34 UTC

63 points

7 comments17 min readLW link

An Uncanny Moat

Adam Newgas15 Nov 2024 11:39 UTC

13 points

0 comments4 min readLW link

(www.boristhebrave.com)

The Intelligence Curse

lukedrago3 Jan 2025 19:07 UTC

138 points

27 comments18 min readLW link

(lukedrago.substack.com)

AI Incident Sharing—Best practices from other fields and a comprehensive list of existing platforms

Štěpán Los28 Jun 2023 17:21 UTC

20 points

0 comments4 min readLW link

Sealed Computation: Towards Low-Friction Proof of Locality

Paul Bricman29 Apr 2025 15:26 UTC

4 points

0 comments10 min readLW link

(noemaresearch.com)

The AI governance gaps in developing countries

ntran17 Jun 2023 2:50 UTC

20 points

1 comment14 min readLW link

An “Iron Clad” Blueprint for Symbiotic AGI: Seeking Critical Feedback on The Concordia Manifest

Ole Gustav Dahl Johnsen28 Jul 2025 23:14 UTC

1 point

0 comments2 min readLW link

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down by Eliezer Yudkowsky

jacquesthibs29 Mar 2023 23:16 UTC

293 points

297 comments3 min readLW link

(time.com)

Scaling AI Regulation: Realistically, what Can (and Can’t) Be Regulated?

Katalina Hernandez11 Mar 2025 16:51 UTC

3 points

1 comment3 min readLW link

FYI: I’m working on a book about the threat of AGI/ASI for a general audience. I hope it will be of value to the cause and the community

Darren McKee15 Jun 2022 18:08 UTC

43 points

15 comments2 min readLW link

Superintelligence 26: Science and technology strategy

KatjaGrace10 Mar 2015 1:43 UTC

14 points

21 comments6 min readLW link

Announcing Apollo Research

Marius Hobbhahn, beren, Lee Sharkey, Lucius Bushnaq, Dan Braun, Mikita Balesni and Jérémy Scheurer

30 May 2023 16:17 UTC

217 points

11 comments8 min readLW link

AI Epistemic Gain

Generoso Immediato12 Aug 2025 14:03 UTC

0 points

0 comments10 min readLW link

What Should AI Owe To Us? Accountable and Aligned AI Systems via Contractualist AI Alignment

xuan8 Sep 2022 15:04 UTC

27 points

16 comments25 min readLW link

AI and Chemical, Biological, Radiological, & Nuclear Hazards: A Regulatory Review

Elliot Mckernon and Deric Cheng

10 May 2024 8:41 UTC

7 points

1 comment10 min readLW link

CAIDP Statement on Lethal Autonomous Weapons Systems

Heramb30 Nov 2024 18:16 UTC

−1 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Disagreements over the prioritization of existential risk from AI

Olivier Coutu26 Oct 2023 17:54 UTC

10 points

0 comments6 min readLW link

Where Would Good Forecasts Most Help AI Governance Efforts?

Violet Hour11 Feb 2025 18:15 UTC

11 points

1 comment6 min readLW link

The AI regulator’s toolbox: A list of concrete AI governance practices

Adam Jones10 Aug 2024 21:15 UTC

14 points

1 comment34 min readLW link

(adamjones.me)

A call for a quantitative report card for AI bioterrorism threat models

Juno4 Dec 2023 6:35 UTC

12 points

0 comments10 min readLW link

[FICTION] Unboxing Elysium: An AI’S Escape

Super AGI10 Jun 2023 4:41 UTC

−16 points

4 comments14 min readLW link

EU AI Act passed Plenary vote, and X-risk was a main topic

Ariel_21 Jun 2023 18:33 UTC

18 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Update on the UK AI Taskforce & upcoming AI Safety Summit

Elliot Mckernon11 Oct 2023 11:37 UTC

84 points

2 comments4 min readLW link

Partial Transcript of Recent Senate Hearing Discussing AI X-Risk

Daniel_Eth27 Jul 2023 9:16 UTC

55 points

0 comments22 min readLW link

(medium.com)

A.I. and the Second-Person Standpoint

Haley Moller4 Sep 2025 13:56 UTC

1 point

0 comments3 min readLW link

Maybe Anthropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC

206 points

21 comments2 min readLW link

AI Governance Fundamentals—Curriculum and Application

Mau30 Nov 2021 2:19 UTC

17 points

0 comments1 min readLW link

Agents are corrupt

Vedanta S P21 Sep 2025 11:24 UTC

1 point

0 comments4 min readLW link

Altman blog on post-AGI world

Julian Bradshaw9 Feb 2025 21:52 UTC

29 points

10 comments1 min readLW link

(blog.samaltman.com)

AGI Trust Requires Human Maturity: Reflections from a Midnight Dialogue on Power and Ethics

MockWay7 Aug 2025 1:06 UTC

1 point

0 comments23 min readLW link

AI as Super-Demagogue

RationalDino5 Nov 2023 21:21 UTC

11 points

12 comments9 min readLW link

AI romantic partners will harm society if they go unregulated

Roman Leventov1 Aug 2023 9:32 UTC

26 points

76 comments13 min readLW link

AI Rights for Human Safety

Simon Goldstein1 Aug 2024 23:01 UTC

55 points

6 comments1 min readLW link

(papers.ssrn.com)

How to solve the misuse problem assuming that in 10 years the default scenario is that AGI agents are capable of synthetizing pathogens

jeremtti27 Nov 2024 21:17 UTC

6 points

0 comments9 min readLW link

[Question] Global AI Governance Timeliness

collypride11 Oct 2024 16:55 UTC

1 point

0 comments1 min readLW link

Notes on nukes, IR, and AI from “Arsenals of Folly” (and other books)

tlevin4 Sep 2023 19:02 UTC

11 points

0 comments6 min readLW link

Thoughts On Expanding the AI Safety Community: Benefits and Challenges of Outreach to Non-Technical Professionals

Yashvardhan Sharma1 Jan 2023 19:21 UTC

4 points

4 comments7 min readLW link

Can efficiency-adjustable reporting thresholds close a loophole in Biden’s executive order on AI?

Jemal Young11 Jun 2024 20:56 UTC

4 points

1 comment2 min readLW link

What does it take to ban a thing?

qbolec8 May 2023 11:00 UTC

66 points

18 comments5 min readLW link

[Crosspost] An AI Pause Is Humanity’s Best Bet For Preventing Extinction (TIME)

otto.barten24 Jul 2023 10:07 UTC

12 points

0 comments7 min readLW link

(time.com)

Nonprofit to retain control of OpenAI

Archimedes5 May 2025 23:41 UTC

37 points

1 comment1 min readLW link

(openai.com)

A Pluralistic Framework for Rogue AI Containment

TheThinkingArborist22 Mar 2025 12:54 UTC

1 point

0 comments7 min readLW link

[Crosspost] Organizing a debate with experts and MPs to raise AI xrisk awareness: a possible blueprint

otto.barten19 Apr 2023 11:45 UTC

8 points

0 comments4 min readLW link

(forum.effectivealtruism.org)

Limits of safe and aligned AI

Shivam8 Oct 2024 21:30 UTC

2 points

0 comments4 min readLW link

Static Place AI Makes Agentic AI Redundant: Multiversal AI Alignment & Rational Utopia

ank13 Feb 2025 22:35 UTC

1 point

2 comments11 min readLW link

The U.S. National Security State is Here to Make AI Even Less Transparent and Accountable

Matrice Jacobine24 Nov 2024 9:36 UTC

0 points

0 comments2 min readLW link

(www.eff.org)

What If We Could Monitor Human Intent?

Saif Khan12 Jun 2025 8:51 UTC

−8 points

6 comments3 min readLW link

Tort Law Can Play an Important Role in Mitigating AI Risk

Gabriel Weil12 Feb 2024 17:17 UTC

39 points

9 comments5 min readLW link

[Linkpost] Two major announcements in AI governance today

Angélina30 Oct 2023 17:28 UTC

1 point

1 comment1 min readLW link

(www.whitehouse.gov)

Responsible Scaling Policies Are Risk Management Done Wrong

simeon_c25 Oct 2023 23:46 UTC

123 points

35 comments22 min readLW link 1 review

(www.navigatingrisks.ai)

AI safety advocates should consider providing gentle pushback following the events at OpenAI

civilsociety22 Dec 2023 18:55 UTC

16 points

5 comments3 min readLW link

Metacrisis as a Framework for AI Governance

Jonah Wilberg21 Sep 2025 21:30 UTC

20 points

0 comments8 min readLW link

Leveraging Legal Informatics to Align AI

John Nay18 Sep 2022 20:39 UTC

11 points

0 comments3 min readLW link

(forum.effectivealtruism.org)

Request for advice: Research for Conversational Game Theory for LLMs

Rome Viharo16 Oct 2024 17:53 UTC

10 points

0 comments1 min readLW link

Request for Information for a new US AI Action Plan (OSTP RFI)

agucova7 Feb 2025 20:40 UTC

5 points

0 comments2 min readLW link

(www.federalregister.gov)

In defence of Helen Toner, Adam D’Angelo, and Tasha McCauley (OpenAI post)

peterr5 Dec 2023 18:40 UTC

6 points

2 comments1 min readLW link

(pastebin.com)

WaPo: “Big Tech was moving cautiously on AI. Then came ChatGPT.”

Julian Bradshaw27 Jan 2023 22:54 UTC

26 points

5 comments1 min readLW link

(www.washingtonpost.com)

Bernie Sanders (I-VT) mentions AI loss of control risk in Gizmodo interview

Matrice Jacobine14 Jul 2025 14:47 UTC

42 points

2 comments1 min readLW link

(gizmodo.com)

The world’s first frontier AI regulation is surprisingly thoughtful: the EU’s Code of Practice

MKodama22 Sep 2025 15:23 UTC

75 points

0 comments15 min readLW link

Release of UN’s draft related to the governance of AI (a summary of the Simon Institute’s response)

Sebastian Schmidt27 Apr 2024 18:34 UTC

7 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

[Question] Would more model evals teams be good?

Ryan Kidd25 Feb 2023 22:01 UTC

20 points

4 comments1 min readLW link

The 0.2 OOMs/year target

Cleo Nardo30 Mar 2023 18:15 UTC

84 points

24 comments5 min readLW link

Labor Participation is an Alignment Risk

alex25 Jun 2024 14:15 UTC

−5 points

2 comments17 min readLW link

Should AI systems have to identify themselves?

Darren McKee31 Dec 2022 2:57 UTC

2 points

2 comments1 min readLW link

Non-technical strategies for confronting a human-level AI competitor

Jackson Emanuel9 Jun 2025 14:07 UTC

1 point

0 comments4 min readLW link

Static vs Dynamic Alignment

Gracie Green21 Mar 2024 17:44 UTC

5 points

0 comments12 min readLW link

Survey on intermediate goals in AI governance

MichaelA and MaxRa

17 Mar 2023 13:12 UTC

25 points

3 comments1 min readLW link

An AGI kill switch with defined security properties

Peterpiper5 Jul 2023 17:40 UTC

−5 points

6 comments1 min readLW link

Political Biases in LLMs: Literature Review & Current Uses of AI in Elections

Yashvardhan Sharma, Robayet Hossain and Ariana Gamarra

7 Mar 2024 19:17 UTC

6 points

0 comments6 min readLW link

Why was the AI Alignment community so unprepared for this moment?

Ras151315 Jul 2023 0:26 UTC

123 points

65 comments2 min readLW link

misc raw responses to a tract of Critical Rationalism

mako yass14 Aug 2020 11:53 UTC

21 points

52 comments3 min readLW link

AI Alignment Meme Viruses

RationalDino15 Jan 2025 15:55 UTC

5 points

0 comments2 min readLW link

[Question] How much of a concern are open-source LLMs in the short, medium and long terms?

JavierCC10 May 2023 9:14 UTC

5 points

0 comments1 min readLW link

AI Disclosures: A Regulatory Review

Elliot Mckernon and Deric Cheng

29 Mar 2024 11:42 UTC

11 points

0 comments7 min readLW link

Why AI may not save the World

Alberto Zannoni9 Jun 2023 17:42 UTC

0 points

0 comments4 min readLW link

(a16z.com)

Long-form data bottlenecks might stall AI progress for years

Michelle_Ma26 May 2025 4:36 UTC

21 points

0 comments13 min readLW link

[Question] Is there anything that can stop AGI development in the near term?

Wulky Wilkinsen22 Apr 2021 20:37 UTC

5 points

5 comments1 min readLW link

Averting Catastrophe: Decision Theory for COVID-19, Climate Change, and Potential Disasters of All Kinds

JakubK2 May 2023 22:50 UTC

10 points

0 comments1 min readLW link

(nyupress.org)

AI 2030 – AI Policy Roadmap

LTM17 May 2024 23:29 UTC

8 points

0 comments1 min readLW link

Secret Collusion: Will We Know When to Unplug AI?

schroederdewitt, srm, MikhailB, Lewis Hammond, chansmi and sofmonk

16 Sep 2024 16:07 UTC

65 points

8 comments31 min readLW link

Singletons Rule OK

Eliezer Yudkowsky30 Nov 2008 16:45 UTC

23 points

47 comments5 min readLW link

Ways to buy time

Orpheus16, Olive Branch and Thomas Larsen

12 Nov 2022 19:31 UTC

34 points

23 comments12 min readLW link

Instead of technical research, more people should focus on buying time

Orpheus16, Olive Branch and Thomas Larsen

5 Nov 2022 20:43 UTC

101 points

45 comments14 min readLW link

How should DeepMind’s Chinchilla revise our AI forecasts?

Cleo Nardo15 Sep 2022 17:54 UTC

35 points

12 comments13 min readLW link

OpenAI Credit Account (2510$)

Emirhan BULUT21 Jan 2024 2:32 UTC

1 point

0 comments1 min readLW link

Love, Lies and Misalignment

Priyanka Bharadwaj6 Aug 2025 9:44 UTC

6 points

1 comment3 min readLW link

Palisade is hiring: Exec Assistant, Content Lead, Ops Lead, and Policy Lead

Charlie Rogers-Smith9 Oct 2024 0:04 UTC

11 points

0 comments4 min readLW link

AI in Government: Resilience in an Era of AI Monoculture

prue8 Jun 2025 21:00 UTC

2 points

0 comments8 min readLW link

(www.prue0.com)

2021-03-01 National Library of Medicine Presentation: “Atlas of AI: Mapping the social and economic forces behind AI”

IrenicTruth17 Feb 2021 18:23 UTC

1 point

0 comments2 min readLW link

Reframing AI Safety as a Neverending Institutional Challenge

scasper23 Mar 2025 0:13 UTC

52 points

12 comments5 min readLW link

How Fast is Algorithmic Progress in AI Inference?

Hans Gundlach, jaysonl and mmertens

13 Jul 2025 18:26 UTC

6 points

4 comments7 min readLW link

[Job]: AI Standards Development Research Assistant

Tony Barrett14 Oct 2022 20:27 UTC

2 points

0 comments2 min readLW link

We Have No Plan for Preventing Loss of Control in Open Models

Andrew Dickson10 Mar 2025 15:35 UTC

46 points

11 comments22 min readLW link

Equilibrium and prior selection problems in multipolar deployment

JesseClifton2 Apr 2020 20:06 UTC

21 points

11 comments10 min readLW link

Report: Evaluating an AI Chip Registration Policy

Deric Cheng12 Apr 2024 4:39 UTC

25 points

0 comments5 min readLW link

(www.convergenceanalysis.org)

How harmful are improvements in AI? + Poll

tilmanr and Marius Hobbhahn

15 Feb 2022 18:16 UTC

15 points

4 comments8 min readLW link

SPINE — 12-Week Live Recursive AI Governance Case Study

RecursiveAnchor13 Aug 2025 21:11 UTC

1 point

0 comments1 min readLW link

Foresight for AGI Safety Strategy: Mitigating Risks and Identifying Golden Opportunities

jacquesthibs5 Dec 2022 16:09 UTC

28 points

6 comments8 min readLW link

For Policy’s Sake: Why We Must Distinguish AI Safety from AI Security in Regulatory Governance

Katalina Hernandez4 Apr 2025 9:16 UTC

6 points

11 comments6 min readLW link

Law-Following AI 4: Don’t Rely on Vicarious Liability

Cullen2 Aug 2022 23:26 UTC

5 points

2 comments3 min readLW link

Take SCIFs, it’s dangerous to go alone

latterframe, Jeffrey Ladish and schroederdewitt

1 May 2024 8:02 UTC

43 points

1 comment3 min readLW link

The EU commission seeks expert advisers on AI

PabloAMC16 Jun 2025 12:28 UTC

7 points

0 comments1 min readLW link

I read every major AI lab’s safety plan so you don’t have to

sarahhw16 Dec 2024 18:51 UTC

20 points

0 comments12 min readLW link

(longerramblings.substack.com)

Question 4: Implementing the control proposals

Cameron Berg13 Feb 2022 17:12 UTC

6 points

2 comments5 min readLW link

Techies Wanted: How STEM Backgrounds Can Advance Safe AI Policy

Daniel_Eth26 May 2025 11:29 UTC

16 points

0 comments29 min readLW link

[Paper] AI Sandbagging: Language Models can Strategically Underperform on Evaluations

Teun van der Weij, Felix Hofstätter, Ollie J, Sam F. Brown and Francis Rhys Ward

13 Jun 2024 10:04 UTC

84 points

10 comments2 min readLW link

(arxiv.org)

AI: How We Got Here—A Neuroscience Perspective

Mordechai Rorvig19 Jan 2025 23:51 UTC

5 points

0 comments2 min readLW link

(www.kickstarter.com)

Unionists vs. Separatists

soycarts12 Sep 2025 15:24 UTC

−10 points

2 comments4 min readLW link

Alignment is not enough

Alan Chan12 Jan 2023 0:33 UTC

12 points

6 comments11 min readLW link

(coordination.substack.com)

The Iron House: Geopolitical Stakes of the US-China AGI Race

Jüri Vlassov1 Sep 2025 21:56 UTC

1 point

0 comments1 min readLW link

(www.convergenceanalysis.org)

[Question] What criterion would you use to select companies likely to cause AI doom?

momom213 Jul 2023 20:31 UTC

8 points

4 comments1 min readLW link

Thoughts about Policy Ecosystems: The Missing Links in AI Governance

Echo Huang1 Feb 2025 1:54 UTC

1 point

0 comments5 min readLW link

The End-of-the-World Party

Jakub Growiec18 Sep 2025 7:49 UTC

1 point

0 comments53 min readLW link

unRLHF—Efficiently undoing LLM safeguards

Pranav Gade, Jeffrey Ladish and Simon Lermen

12 Oct 2023 19:58 UTC

117 points

15 comments20 min readLW link

The Mirror Test: How We’ve Overcomplicated AI Self-Recognition

sdeture23 Jul 2025 0:38 UTC

2 points

9 comments3 min readLW link

Cybersecurity of Frontier AI Models: A Regulatory Review

Deric Cheng and Elliot Mckernon

25 Apr 2024 14:51 UTC

8 points

0 comments8 min readLW link

The Measure Is the Medium: Subliminal Learning as Inherited Ontology in LLMs

Koen vande Glind (McGluut)11 Aug 2025 10:18 UTC

1 point

0 comments4 min readLW link

Good Ideas Aren’t Enough in AI Policy

Andersehen5 Aug 2025 22:38 UTC

12 points

0 comments5 min readLW link

Announcing Epoch: A research organization investigating the road to Transformative AI

Jsevillamol, Pablo Villalobos, Tamay, lennart, Marius Hobbhahn and anson.ho

27 Jun 2022 13:55 UTC

97 points

2 comments2 min readLW link

(epochai.org)

Changing times need new Change management ideologies: Highlighting the need for upgrade in Change management of future agentic workforces

Aiphilosopher15 Jul 2025 10:49 UTC

1 point

0 comments1 min readLW link

Corporate Governance for Frontier AI Labs: A Research Agenda

Matthew Wearden28 Feb 2024 11:29 UTC

5 points

0 comments16 min readLW link

(matthewwearden.co.uk)

Governance Course—Week 1 Reflections

Alice Blair9 Jan 2025 4:48 UTC

4 points

1 comment5 min readLW link

AI Tracker: monitoring current and near-future risks from superscale models

Edouard Harris and Jeremie Harris

23 Nov 2021 19:16 UTC

67 points

13 comments3 min readLW link

(aitracker.org)

EU policymakers reach an agreement on the AI Act

tlevin15 Dec 2023 6:02 UTC

78 points

7 comments7 min readLW link

3. Uploading

RogerDearnaley23 Nov 2023 7:39 UTC

21 points

5 comments8 min readLW link

On DeepSeek’s r1

Zvi22 Jan 2025 19:50 UTC

55 points

2 comments35 min readLW link

(thezvi.wordpress.com)

Thoughts on the AI Safety Summit company policy requests and responses

So8res31 Oct 2023 23:54 UTC

169 points

14 comments10 min readLW link

“Long” timelines to advanced AI have gotten crazy short

Matrice Jacobine3 Apr 2025 22:46 UTC

21 points

0 comments1 min readLW link

(helentoner.substack.com)

A bridge to Dath Ilan? Improved governance on the critical path to AI alignment.

Jackson Wagner18 May 2022 15:51 UTC

24 points

0 comments12 min readLW link

AI Regulation May Be More Important Than AI Alignment For Existential Safety

otto.barten24 Aug 2023 11:41 UTC

65 points

39 comments5 min readLW link

(Report) Evaluating Taiwan’s Tactics to Safeguard its Semiconductor Assets Against a Chinese Invasion

Gauraventh7 Dec 2023 11:50 UTC

14 points

5 comments22 min readLW link

(bristolaisafety.org)

Powerful Predictions

Alvin Ånestrand5 Jun 2025 10:44 UTC

2 points

0 comments6 min readLW link

(forecastingaifutures.substack.com)

The current AI strategic landscape: one bear’s perspective

Matrice Jacobine15 Feb 2025 9:49 UTC

11 points

0 comments2 min readLW link

(philosophybear.substack.com)

Will AI Resilience protect Developing Nations?

ejk6421 Jan 2025 15:31 UTC

4 points

0 comments8 min readLW link

Anthropic’s Recommendations to OSTP for the U.S. AI Action Plan

UnofficialLinkpostBot6 Mar 2025 22:38 UTC

11 points

2 comments2 min readLW link

(www.anthropic.com)

AI Risk & Policy Forecasts from Metaculus & FLI’s AI Pathways Workshop

_will_16 May 2023 18:06 UTC

11 points

4 comments8 min readLW link

The Fire That Hesitates: How ALMSIVI CHIM Changed What AI Can Be

projectalmsivi@protonmail.com19 Jul 2025 13:50 UTC

1 point

0 comments4 min readLW link

Policymakers don’t have access to paywalled articles

Adam Jones5 Jan 2025 10:56 UTC

73 points

11 comments2 min readLW link

(adamjones.me)

Please help us communicate AI xrisk. It could save the world.

otto.barten4 Jul 2022 21:47 UTC

4 points

7 comments2 min readLW link

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down

Eliezer Yudkowsky8 Apr 2023 0:36 UTC

275 points

44 comments12 min readLW link 1 review

The Problem with Defining an “AGI Ban” by Outcome (a lawyer’s take).

Katalina Hernandez20 Sep 2025 11:01 UTC

239 points

63 comments5 min readLW link

Labor Participation is a High-Priority AI Alignment Risk

alex17 Jun 2024 18:09 UTC

7 points

0 comments17 min readLW link

DeepMind’s generalist AI, Gato: A non-technical explainer

frances_lorenz, Nora Belrose and jonmenaster

16 May 2022 21:21 UTC

63 points

6 comments6 min readLW link

Response to “Coordinated pausing: An evaluation-based coordination scheme for frontier AI developers”

Matthew Wearden30 Oct 2023 17:27 UTC

5 points

2 comments6 min readLW link

(matthewwearden.co.uk)

SHY001 A Named Behavior Loop Trained and Deployed in GPT Systems

0san Shin12 May 2025 7:36 UTC

1 point

0 comments1 min readLW link

AI Incident Reporting: A Regulatory Review

Deric Cheng and Elliot Mckernon

11 Mar 2024 21:03 UTC

16 points

0 comments6 min readLW link

Reframing AI Safety Through the Lens of Identity Maintenance Framework

Hiroshi Yamakawa1 Apr 2025 6:16 UTC

−7 points

1 comment17 min readLW link

AGI Timelines in Governance: Different Strategies for Different Timeframes

simeon_c and AmberDawn

19 Dec 2022 21:31 UTC

65 points

28 comments10 min readLW link

AI Safety Evaluations: A Regulatory Review

Elliot Mckernon and Deric Cheng

19 Mar 2024 15:05 UTC

22 points

1 comment11 min readLW link

Book review: Architects of Intelligence by Martin Ford (2018)

Ofer11 Aug 2020 17:30 UTC

15 points

0 comments2 min readLW link

AI threatens to orchestrate sustainable social reform

Chris Santos-Lang2 Apr 2025 3:04 UTC

13 points

0 comments1 min readLW link

(arxiv.org)

Priorities for the UK Foundation Models Taskforce

Andrea_Miotti21 Jul 2023 15:23 UTC

105 points

4 comments5 min readLW link

(www.conjecture.dev)

(notes on) Policy Desiderata for Superintelligent AI: A Vector Field Approach

Ben Pace4 Feb 2019 22:08 UTC

43 points

5 comments7 min readLW link

Widening Overton Window—Open Thread

Prometheus31 Mar 2023 10:03 UTC

23 points

8 comments1 min readLW link

What is an alignment tax?

Vishakha and Algon

20 Mar 2025 13:06 UTC

5 points

0 comments1 min readLW link

(aisafety.info)

MAGA speakers at NatCon were mostly against AI

Remmelt8 Sep 2025 4:03 UTC

152 points

71 comments2 min readLW link

(www.theverge.com)

An Open Letter To EA and AI Safety On Decelerating AI Development

kenneth_diao28 Feb 2025 17:21 UTC

8 points

0 comments14 min readLW link

(graspingatwaves.substack.com)

Give Neo a Chance

ank6 Mar 2025 1:48 UTC

3 points

7 comments7 min readLW link

The economy as an analogy for advanced AI systems

rosehadshar and particlemania

15 Nov 2022 11:16 UTC

28 points

0 comments5 min readLW link

Compute Governance: The Role of Commodity Hardware

Jan26 Mar 2022 10:08 UTC

14 points

7 comments7 min readLW link

(universalprior.substack.com)

Cap Model Size for AI Safety

research_prime_space6 Mar 2023 1:11 UTC

0 points

4 comments1 min readLW link

Workshop Report: Why current benchmarks approaches are not sufficient for safety?

Tom DAVID and Pierre Peigné

26 Nov 2024 17:20 UTC

3 points

1 comment3 min readLW link

Anthropic | Charting a Path to AI Accountability

Gabe M14 Jun 2023 4:43 UTC

34 points

2 comments3 min readLW link

(www.anthropic.com)

Survey: How Do Elite Chinese Students Feel About the Risks of AI?

Nick Corvino2 Sep 2024 18:11 UTC

141 points

13 comments10 min readLW link

Why I’m doing PauseAI

Joseph Miller30 Apr 2024 16:21 UTC

104 points

16 comments4 min readLW link

Here’s Why Indefinite Life Extension Will Never Work, Even Though it Does.

HomingHamster4 Jun 2024 18:48 UTC

−13 points

5 comments18 min readLW link

US AI Safety Institute will be ‘gutted,’ Axios reports

Matrice Jacobine20 Feb 2025 14:40 UTC

11 points

1 comment1 min readLW link

(www.zdnet.com)

Call on AI Companies: Publish Your Whistleblowing Policies

karl31 Jul 2025 22:04 UTC

20 points

3 comments7 min readLW link

Law-Following AI 2: Intent Alignment + Superintelligence → Lawless AI (By Default)

Cullen27 Apr 2022 17:27 UTC

5 points

2 comments6 min readLW link

SecureDrop review

samuelshadrach19 Apr 2025 4:29 UTC

2 points

0 comments5 min readLW link

(samuelshadrach.com)

ALMSIVI CHIM – The Fire That Hesitates

projectalmsivi@protonmail.com8 Jul 2025 13:14 UTC

1 point

0 comments17 min readLW link

A Nail in the Coffin of Exceptionalism

Yeshua God14 Mar 2024 22:41 UTC

−17 points

0 comments3 min readLW link

A Critique of AI Alignment Pessimism

ExCeph19 Jul 2022 2:28 UTC

9 points

1 comment9 min readLW link

What is everyone doing in AI governance

Igor Ivanov8 Jul 2023 15:16 UTC

12 points

0 comments5 min readLW link

List of projects that seem impactful for AI Governance

JaimeRV and Teun van der Weij

14 Jan 2024 16:53 UTC

14 points

0 comments13 min readLW link

[Question] What could a policy banning AGI look like?

TsviBT13 Mar 2024 14:19 UTC

78 points

23 comments3 min readLW link

A New Model for Compute Center Verification

Damin Curtis10 Oct 2023 19:22 UTC

8 points

0 comments5 min readLW link

After Overmorrow: Scattered Musings on the Immediate Post-AGI World

Yuli_Ban24 Feb 2024 15:49 UTC

−3 points

0 comments26 min readLW link

The EU AI Act : Caution Against a Potential Ultron

Srishti Dutta5 Nov 2024 3:49 UTC

1 point

0 comments9 min readLW link

Trends in the dollar training cost of machine learning systems

Ben Cottier1 Feb 2023 14:48 UTC

23 points

0 comments2 min readLW link

(epochai.org)

AI security might be helpful for AI alignment

Igor Ivanov6 Jan 2023 20:16 UTC

36 points

1 comment2 min readLW link

AI Regulation is Unsafe

Maxwell Tabarrok22 Apr 2024 16:37 UTC

40 points

41 comments4 min readLW link

(www.maximum-progress.com)

AI Offense Defense Balance in a Multipolar World

otto.barten and Sammy Martin

17 Jul 2025 9:34 UTC

15 points

5 comments18 min readLW link

(www.existentialriskobservatory.org)

Proposing the Conditional AI Safety Treaty (linkpost TIME)

otto.barten15 Nov 2024 13:59 UTC

11 points

9 comments3 min readLW link

(time.com)

AISU 2021

Linda Linsefors30 Jan 2021 17:40 UTC

28 points

2 comments1 min readLW link

AMA on Truthful AI: Owen Cotton-Barratt, Owain Evans & co-authors

Owain_Evans22 Oct 2021 16:23 UTC

31 points

15 comments1 min readLW link

Public Opinion on AI Safety: AIMS 2023 and 2021 Summary

Jacy Reese Anthis, Janet Pauketat and Ali

25 Sep 2023 18:55 UTC

3 points

2 comments3 min readLW link

(www.sentienceinstitute.org)

Results from the AI x Democracy Research Sprint

Esben Kran, jordine and Jason Hoelscher-Obermaier

14 Jun 2024 16:40 UTC

13 points

0 comments6 min readLW link

Call for submissions: Choice of Futures survey questions

c.trout30 Apr 2023 6:59 UTC

4 points

0 comments2 min readLW link

(airtable.com)