RSS

Mia Taylor

Karma: 192

Are Short AI Timelines Really Higher-Lev­er­age?

23 Jan 2026 7:28 UTC
23 points
1 comment15 min readLW link
(www.forethought.org)

Blog post: how im­por­tant is the model spec if al­ign­ment fails?

Mia Taylor3 Dec 2025 20:19 UTC
11 points
1 comment1 min readLW link
(newsletter.forethought.org)

Harm­less re­ward hacks can gen­er­al­ize to mis­al­ign­ment in LLMs

26 Aug 2025 17:32 UTC
52 points
7 comments7 min readLW link

Model Or­ganisms for Emer­gent Misalignment

16 Jun 2025 15:46 UTC
118 points
19 comments5 min readLW link