RSS

Jiaxin Wen

Karma: 98

https://​​jiaxin-wen.github.io/​​

Self-Recog­ni­tion Fine­tun­ing can Re­v­erse and Prevent Emer­gent Misalignment

15 Mar 2026 0:11 UTC
40 points
7 comments7 min readLW link

Un­su­per­vised Elic­i­ta­tion of Lan­guage Models

13 Jun 2025 16:15 UTC
57 points
12 comments2 min readLW link