RSS

Mia Taylor

Karma: 223

A draft hon­esty policy for cred­ible com­mu­ni­ca­tion with AI systems

6 May 2026 18:50 UTC
3 points
0 comments13 min readLW link
(www.forethought.org)

The value of moral diversity

Mia Taylor14 Apr 2026 19:07 UTC
23 points
0 comments10 min readLW link
(newsletter.forethought.org)

Mo­ral pub­lic goods are a big deal for whether we get a good future

24 Feb 2026 14:14 UTC
12 points
0 comments18 min readLW link
(www.forethought.org)

Are Short AI Timelines Really Higher-Lev­er­age?

23 Jan 2026 7:28 UTC
25 points
1 comment15 min readLW link
(www.forethought.org)

Blog post: how im­por­tant is the model spec if al­ign­ment fails?

Mia Taylor3 Dec 2025 20:19 UTC
11 points
1 comment1 min readLW link
(newsletter.forethought.org)

Harm­less re­ward hacks can gen­er­al­ize to mis­al­ign­ment in LLMs

26 Aug 2025 17:32 UTC
52 points
7 comments7 min readLW link

Model Or­ganisms for Emer­gent Misalignment

16 Jun 2025 15:46 UTC
119 points
19 comments5 min readLW link