RSS

Alice Blair

Karma: 1,102

Dumping out a lot of thoughts on LW in hopes that something sticks. Eternally upskilling.

I write the ML Safety Newsletter

DMs open, especially for promising opportunities in AI Safety and potential collaborators. I’m maybe interested in helping you optimize the communications of your new project.

AI Safety Newslet­ter #70: Au­to­mated War­fare and AI Layoffs

24 Mar 2026 15:30 UTC
8 points
0 comments4 min readLW link
(newsletter.safe.ai)

AI Safety Newslet­ter #69: Depart­ment of War, An­thropic, and Na­tional Security

13 Mar 2026 16:05 UTC
10 points
0 comments4 min readLW link
(newsletter.safe.ai)

MLSN #19: Hon­esty, Disem­pow­er­ment, & Cybersecurity

Alice Blair12 Mar 2026 15:42 UTC
6 points
0 comments5 min readLW link
(newsletter.mlsafety.org)

MLSN #18: Ad­ver­sar­ial Diffu­sion, Ac­ti­va­tion Or­a­cles, Weird Generalization

20 Jan 2026 17:03 UTC
14 points
3 comments5 min readLW link

The Weak­est Model in the Selector

Alice Blair29 Dec 2025 6:55 UTC
13 points
6 comments1 min readLW link

In Fa­vor of Inkhaven-But-Less

Alice Blair13 Dec 2025 23:16 UTC
26 points
6 comments2 min readLW link

Rea­sons to care about Ca­nary Strings

Alice Blair5 Dec 2025 21:41 UTC
27 points
3 comments2 min readLW link