RSS

Mia Hopman

Karma: 59

Op­ti­mally Com­bin­ing Probe Mon­i­tors and Black Box Monitors

27 Jul 2025 19:13 UTC
37 points
2 comments6 min readLW link

Un­trusted AIs can ex­ploit feed­back in con­trol protocols

27 May 2025 16:41 UTC
30 points
0 comments16 min readLW link