RSS

Tyler Tracy

Karma: 306

Redwood Research. Paperclip miminzer

Op­ti­mally Com­bin­ing Probe Mon­i­tors and Black Box Monitors

27 Jul 2025 19:13 UTC
40 points
2 comments6 min readLW link

Re­cent Red­wood Re­search pro­ject proposals

14 Jul 2025 22:27 UTC
91 points
0 comments3 min readLW link

Un­trusted AIs can ex­ploit feed­back in con­trol protocols

27 May 2025 16:41 UTC
30 points
0 comments16 min readLW link

Ctrl-Z: Con­trol­ling AI Agents via Resampling

16 Apr 2025 16:21 UTC
124 points
0 comments20 min readLW link

When does ex­ter­nal be­havi­our im­ply in­teral struc­ture?

Tyler Tracy31 May 2024 16:41 UTC
6 points
5 comments7 min readLW link