RSS

yix

Karma: 220

https://​​yixiong.dev/​​

Teach­ing Models to Dream of Bet­ter Mon­i­tors through Mon­i­tor Sen­si­tive Training

19 Mar 2026 21:01 UTC
32 points
0 comments10 min readLW link

We need a bet­ter way to eval­u­ate emer­gent misalignment

11 Jan 2026 16:21 UTC
86 points
9 comments6 min readLW link

yix’s Shortform

yix6 Dec 2025 2:27 UTC
2 points
2 comments1 min readLW link

TastyBench: Toward Mea­sur­ing Re­search Taste in LLM

2 Dec 2025 23:26 UTC
31 points
2 comments6 min readLW link

Les­sons from a year of uni­ver­sity AI safety field building

6 Jun 2025 14:35 UTC
34 points
3 comments7 min readLW link

Col­lege tech­ni­cal AI safety hackathon ret­ro­spec­tive—Ge­or­gia Tech

yix15 Nov 2024 0:22 UTC
44 points
2 comments5 min readLW link
(open.substack.com)