RSS

Julian Stastny

Karma: 540

member of technical staff @ redwood research

Prospects for study­ing ac­tual schemers

19 Sep 2025 14:11 UTC
30 points
0 comments58 min readLW link

Re­search Areas in AI Con­trol (The Align­ment Pro­ject by UK AISI)

1 Aug 2025 10:27 UTC
25 points
0 comments18 min readLW link
(alignmentproject.aisi.gov.uk)

Re­cent Red­wood Re­search pro­ject proposals

14 Jul 2025 22:27 UTC
91 points
0 comments3 min readLW link

Linkpost: Red­wood Re­search read­ing list

Julian Stastny10 Jul 2025 18:39 UTC
50 points
0 comments1 min readLW link
(redwoodresearch.substack.com)

What’s worse, spies or schemers?

9 Jul 2025 14:37 UTC
51 points
2 comments5 min readLW link

Two pro­posed pro­jects on ab­stract analo­gies for scheming

Julian Stastny4 Jul 2025 16:03 UTC
47 points
0 comments3 min readLW link

Mak­ing deals with early schemers

20 Jun 2025 18:21 UTC
121 points
41 comments15 min readLW link

Misal­ign­ment and Strate­gic Un­der­perfor­mance: An Anal­y­sis of Sand­bag­ging and Ex­plo­ra­tion Hacking

8 May 2025 19:06 UTC
77 points
3 comments15 min readLW link

7+ tractable di­rec­tions in AI control

28 Apr 2025 17:12 UTC
93 points
1 comment13 min readLW link

Disen­tan­gling four mo­ti­va­tions for act­ing in ac­cor­dance with UDT

Julian Stastny5 Nov 2023 21:26 UTC
35 points
4 comments7 min readLW link