RSS

hadad

Karma: 18

At­tack Selec­tion In Agen­tic AI Con­trol Evals Can De­crease Safety

14 Apr 2026 18:02 UTC
22 points
3 comments18 min readLW link