habryka comments on Responsible Scaling Policy v3

habryka 26 Feb 2026 19:19 UTC
4 points
0
Yes, in as much as the Anthropic RSP was intended as an implementation of if-then-commitments with specific ifs and thens, then that would be inconsistent. But IIRC Holden didn’t work at Anthropic at the time of the RSP getting developed or adopted, and I didn’t see any writing by Holden about the degree to which he does consider Anthropic committed to these thresholds, or see it as a clear instance of something that follows the shape of what is in that paper.
I was here mostly referencing specific conversations or interactions (in e.g. comment threads and emails) I had with Holden as opposed to others at Anthropic about the RSP.