ryan_greenblatt

Karma: 19,553

I’m the chief scientist at Redwood Research.

AIs will greatly change engineering in AI companies well before AGI

ryan_greenblatt9 Sep 2025 16:58 UTC

45 points

9 comments11 min readLW link

Trust me bro, just one more RL scale up, this one will be the real scale up with the good environments, the actually legit one, trust me bro

ryan_greenblatt3 Sep 2025 13:21 UTC

151 points

30 comments8 min readLW link

Attaching requirements to model releases has serious downsides (relative to a different deadline for these requirements)

ryan_greenblatt27 Aug 2025 17:04 UTC

98 points

2 comments3 min readLW link

My AGI timeline updates from GPT-5 (and 2025 so far)

ryan_greenblatt20 Aug 2025 16:11 UTC

162 points

14 comments4 min readLW link

Recent Redwood Research project proposals

ryan_greenblatt, Buck, Julian Stastny, joshc, Alex Mallen, Adam Kaufman , Tyler Tracy, Aryan Bhatt and Joey Yudelson

14 Jul 2025 22:27 UTC

91 points

0 comments3 min readLW link

Jankily controlling superintelligence

ryan_greenblatt27 Jun 2025 14:05 UTC

69 points

4 comments7 min readLW link

What does 10x-ing effective compute get you?

ryan_greenblatt24 Jun 2025 18:33 UTC

55 points

10 comments12 min readLW link

Prefix cache untrusted monitors: a method to apply after you catch your AI

ryan_greenblatt20 Jun 2025 15:56 UTC

32 points

1 comment7 min readLW link

AI safety techniques leveraging distillation

ryan_greenblatt19 Jun 2025 14:31 UTC

61 points

0 comments12 min readLW link

When does training a model change its goals?

Vivek Hebbar and ryan_greenblatt

12 Jun 2025 18:43 UTC

71 points

2 comments15 min readLW link

OpenAI now has an RL API which is broadly accessible

ryan_greenblatt11 Jun 2025 23:39 UTC

43 points

1 comment5 min readLW link

When is it important that open-weight models aren’t released? My thoughts on the benefits and dangers of open-weight models in response to developments in CBRN capabilities.

ryan_greenblatt9 Jun 2025 19:19 UTC

63 points

11 comments9 min readLW link

The best approaches for mitigating “the intelligence curse” (or gradual disempowerment); my quick guesses at the best object-level interventions

ryan_greenblatt31 May 2025 18:20 UTC

71 points

19 comments5 min readLW link

AIs at the current capability level may be important for future safety work

ryan_greenblatt12 May 2025 14:06 UTC

82 points

2 comments4 min readLW link

Slow corporations as an intuition pump for AI R&D automation

ryan_greenblatt and elifland

9 May 2025 14:49 UTC

91 points

23 comments9 min readLW link

What’s going on with AI progress and trends? (As of 5/2025)

ryan_greenblatt2 May 2025 19:00 UTC

75 points

8 comments8 min readLW link

7+ tractable directions in AI control

Julian Stastny and ryan_greenblatt

28 Apr 2025 17:12 UTC

93 points

1 comment13 min readLW link

To be legible, evidence of misalignment probably has to be behavioral

ryan_greenblatt15 Apr 2025 18:14 UTC

57 points

19 comments3 min readLW link

Why do misalignment risks increase as AIs get more capable?

ryan_greenblatt11 Apr 2025 3:06 UTC

33 points

6 comments3 min readLW link

An overview of areas of control work

ryan_greenblatt25 Mar 2025 22:02 UTC

32 points

0 comments28 min readLW link