Responsible Scaling Policies

TagLast edit: 27 Oct 2023 19:43 UTC by elifland

As proposed by ARC Evals, and with a version implemented by Anthropic

ARC Evals: Responsible Scaling Policies

Zach Stein-Perlman28 Sep 2023 4:30 UTC

40 points

10 comments2 min readLW link 1 review

(evals.alignment.org)

On ‘Responsible Scaling Policies’ (RSPs)

Zvi5 Dec 2023 16:10 UTC

49 points

3 comments37 min readLW link

(thezvi.wordpress.com)

Anthropic: Reflections on our Responsible Scaling Policy

Zac Hatfield-Dodds20 May 2024 4:14 UTC

30 points

21 comments10 min readLW link

(www.anthropic.com)

RSPs are pauses done right

evhub14 Oct 2023 4:06 UTC

166 points

79 comments7 min readLW link 1 review

Responsible Scaling Policies Are Risk Management Done Wrong

simeon_c25 Oct 2023 23:46 UTC

123 points

35 comments22 min readLW link 1 review

(www.navigatingrisks.ai)

Vaniver’s thoughts on Anthropic’s RSP

Vaniver28 Oct 2023 21:06 UTC

46 points

4 comments3 min readLW link

What’s up with “Responsible Scaling Policies”?

habryka and ryan_greenblatt

29 Oct 2023 4:17 UTC

100 points

9 comments20 min readLW link 1 review

AI #35: Responsible Scaling Policies

Zvi26 Oct 2023 13:30 UTC

66 points

10 comments55 min readLW link

(thezvi.wordpress.com)

Thoughts on responsible scaling policies and regulation

paulfchristiano24 Oct 2023 22:21 UTC

220 points

34 comments6 min readLW link

We’re Not Ready: thoughts on “pausing” and responsible scaling policies

HoldenKarnofsky27 Oct 2023 15:19 UTC

200 points

33 comments8 min readLW link

OMMC Announces RIP

Adam Scholl and aysja

1 Apr 2024 23:20 UTC

193 points

6 comments2 min readLW link 1 review

Anthropic rewrote its RSP

Zach Stein-Perlman15 Oct 2024 14:25 UTC

46 points

19 comments6 min readLW link

Anthropic’s updated Responsible Scaling Policy

Zac Hatfield-Dodds15 Oct 2024 16:46 UTC

38 points

3 comments3 min readLW link

(www.anthropic.com)

Anthropic’s Responsible Scaling Policy & Long-Term Benefit Trust

Zac Hatfield-Dodds19 Sep 2023 15:09 UTC

85 points

26 comments3 min readLW link 1 review

(www.anthropic.com)

OpenAI’s Preparedness Framework: Praise & Recommendations

Orpheus162 Jan 2024 16:20 UTC

66 points

1 comment7 min readLW link

On OpenAI’s Preparedness Framework

Zvi21 Dec 2023 14:00 UTC

51 points

4 comments21 min readLW link

(thezvi.wordpress.com)

Meta: Frontier AI Framework

Zach Stein-Perlman3 Feb 2025 22:00 UTC

33 points

2 comments1 min readLW link

(ai.meta.com)

Paul Christiano on Dwarkesh Podcast

ESRogs3 Nov 2023 22:13 UTC

19 points

0 comments1 min readLW link

(www.dwarkeshpatel.com)

Anthropic & Dario’s dream

Simon Lermen8 Nov 2025 1:19 UTC

55 points

1 comment5 min readLW link

OpenAI: Preparedness framework

Zach Stein-Perlman18 Dec 2023 18:30 UTC

70 points

23 comments4 min readLW link

(openai.com)

Dario Amodei’s prepared remarks from the UK AI Safety Summit, on Anthropic’s Responsible Scaling Policy

Zac Hatfield-Dodds1 Nov 2023 18:10 UTC

85 points

1 comment4 min readLW link

(www.anthropic.com)

Emergency Response Measures for Catastrophic AI Risk

MKodama23 Jan 2026 18:18 UTC

27 points

2 comments3 min readLW link

The Operational Security Failure in Anthropic’s RSP v3

Ugurcan Arikan10 Mar 2026 17:08 UTC

2 points

0 comments4 min readLW link

Anthropic—The case for targeted regulation

anaguma5 Nov 2024 7:07 UTC

11 points

0 comments2 min readLW link

(www.anthropic.com)

How are voluntary commitments on vulnerability reporting going?

Adam Jones22 Feb 2024 8:43 UTC

23 points

1 comment1 min readLW link

(adamjones.me)

OpenAI Preparedness Framework 2.0

Zvi2 May 2025 13:10 UTC

61 points

1 comment23 min readLW link

(thezvi.wordpress.com)

A call for a quantitative report card for AI bioterrorism threat models

Juno4 Dec 2023 6:35 UTC

12 points

0 comments10 min readLW link

No comments.

Re­spon­si­ble Scal­ing Policies

Responsible Scaling Policies