RSS

Tom Tseng

Karma: 139

Lay­ered AI Defenses Have Holes: Vuln­er­a­bil­ities and Key Recommendations

4 Jul 2025 0:07 UTC
13 points
1 comment4 min readLW link
(far.ai)

Does ro­bust­ness im­prove with scale?

25 Jul 2024 20:55 UTC
14 points
0 comments1 min readLW link
(far.ai)

Even Su­per­hu­man Go AIs Have Sur­pris­ing Failure Modes

20 Jul 2023 17:31 UTC
130 points
22 comments10 min readLW link
(far.ai)