Tyler Tracy

Karma: 521

Redwood Research. Paperclip miminzer

Should we combine protocols for AI Control Research?

Ram Potham and Tyler Tracy

26 Jun 2026 19:02 UTC

9 points

0 comments9 min readLW link

A Pipeline for Generating Synthetic Sabotage Trajectories to Red-Team Monitors

Myles H and Tyler Tracy

3 Jun 2026 20:33 UTC

9 points

0 comments12 min readLW link

Introducing LinuxArena

Tyler Tracy, Ram Potham, Nick Kuhn and Myles H

20 Apr 2026 22:00 UTC

80 points

2 comments4 min readLW link

Refactor Arena: A Control Setting for Software Engineering

fastfedora and Tyler Tracy

18 Apr 2026 2:57 UTC

18 points

2 comments25 min readLW link

Attack Selection In Agentic AI Control Evals Can Decrease Safety

Cath Ge-Wang, Tyler Crosse, hadad, Ram Potham and Tyler Tracy

14 Apr 2026 18:02 UTC

26 points

3 comments18 min readLW link

Chat, is this sus?

Tyler Tracy1 Apr 2026 17:32 UTC

54 points

3 comments1 min readLW link

The sum of its parts: composing AI control protocols

ZachParent, Lennart Finke and Tyler Tracy

15 Oct 2025 1:11 UTC

14 points

1 comment11 min readLW link

Optimally Combining Probe Monitors and Black Box Monitors

Tim Hua, James Baskerville, BionicD0LPH1N, Mia Hopman, Aryan Bhatt and Tyler Tracy

27 Jul 2025 19:13 UTC

53 points

2 comments6 min readLW link

Recent Redwood Research project proposals

ryan_greenblatt, Buck, Julian Stastny, joshc, Alex Mallen, Adam Kaufman , Tyler Tracy, Aryan Bhatt and Joey Yudelson

14 Jul 2025 22:27 UTC

99 points

0 comments3 min readLW link

Untrusted AIs can exploit feedback in control protocols

Mia Hopman, BionicD0LPH1N and Tyler Tracy

27 May 2025 16:41 UTC

30 points

0 comments16 min readLW link

Ctrl-Z: Controlling AI Agents via Resampling

Aryan Bhatt, Buck, Adam Kaufman and Tyler Tracy

16 Apr 2025 16:21 UTC

128 points

0 comments20 min readLW link

When does external behaviour imply interal structure?

Tyler Tracy31 May 2024 16:41 UTC

6 points

5 comments7 min readLW link