RSS

Alice Blair

Karma: 1,088

Dumping out a lot of thoughts on LW in hopes that something sticks. Eternally upskilling.

I write the ML Safety Newsletter

DMs open, especially for promising opportunities in AI Safety and potential collaborators. I’m maybe interested in helping you optimize the communications of your new project.

MLSN #18: Ad­ver­sar­ial Diffu­sion, Ac­ti­va­tion Or­a­cles, Weird Generalization

20 Jan 2026 17:03 UTC
14 points
3 comments5 min readLW link

The Weak­est Model in the Selector

Alice Blair29 Dec 2025 6:55 UTC
13 points
4 comments1 min readLW link

In Fa­vor of Inkhaven-But-Less

Alice Blair13 Dec 2025 23:16 UTC
26 points
6 comments2 min readLW link

Rea­sons to care about Ca­nary Strings

Alice Blair5 Dec 2025 21:41 UTC
27 points
3 comments2 min readLW link

Slack Observability

Alice Blair1 Dec 2025 7:52 UTC
32 points
0 comments2 min readLW link