RSS

Yawen Duan

Karma: 154

Illu­sory Safety: Redteam­ing Deep­Seek R1 and the Strongest Fine-Tun­able Models of OpenAI, An­thropic, and Google

Feb 7, 2025, 3:57 AM
37 points

12 votes

Overall karma indicates overall quality.

0 comments10 min readLW link

Even Su­per­hu­man Go AIs Have Sur­pris­ing Failure Modes

Jul 20, 2023, 5:31 PM
130 points

59 votes

Overall karma indicates overall quality.

22 comments10 min readLW link
(far.ai)