RSS

Tyler Tracy

Karma: 500

Redwood Research. Paperclip miminzer

In­tro­duc­ing LinuxArena

20 Apr 2026 22:00 UTC
80 points
2 comments4 min readLW link

Re­fac­tor Arena: A Con­trol Set­ting for Soft­ware Engineering

18 Apr 2026 2:57 UTC
17 points
2 comments25 min readLW link

At­tack Selec­tion In Agen­tic AI Con­trol Evals Can De­crease Safety

14 Apr 2026 18:02 UTC
24 points
3 comments18 min readLW link

Chat, is this sus?

Tyler Tracy1 Apr 2026 17:32 UTC
54 points
3 comments1 min readLW link

The sum of its parts: com­pos­ing AI con­trol protocols

15 Oct 2025 1:11 UTC
14 points
1 comment11 min readLW link

Op­ti­mally Com­bin­ing Probe Mon­i­tors and Black Box Monitors

27 Jul 2025 19:13 UTC
52 points
2 comments6 min readLW link

Re­cent Red­wood Re­search pro­ject proposals

14 Jul 2025 22:27 UTC
93 points
0 comments3 min readLW link

Un­trusted AIs can ex­ploit feed­back in con­trol protocols

27 May 2025 16:41 UTC
30 points
0 comments16 min readLW link

Ctrl-Z: Con­trol­ling AI Agents via Resampling

16 Apr 2025 16:21 UTC
127 points
0 comments20 min readLW link