RSS

Alice Blair

Karma: 1,032

Dumping out a lot of thoughts on LW in hopes that something sticks. Eternally upskilling.

I write the ML Safety Newsletter

DMs open, especially for promising opportunities in AI Safety and potential collaborators. I’m maybe interested in helping you optimize the communications of your new project.

In Fa­vor of Inkhaven-But-Less

Alice Blair13 Dec 2025 23:16 UTC
26 points
5 comments2 min readLW link

Rea­sons to care about Ca­nary Strings

Alice Blair5 Dec 2025 21:41 UTC
25 points
3 comments2 min readLW link

Slack Observability

Alice Blair1 Dec 2025 7:52 UTC
30 points
0 comments2 min readLW link

Gem­ini 3 is Eval­u­a­tion-Para­noid and Contaminated

Alice Blair20 Nov 2025 21:02 UTC
168 points
40 comments7 min readLW link

MLSN #17: Mea­sur­ing Gen­eral AI Abil­ities and Miti­gat­ing Deception

19 Nov 2025 20:11 UTC
5 points
0 comments6 min readLW link
(newsletter.mlsafety.org)

In-Con­text Writ­ing with Son­net 4.5

Alice Blair17 Nov 2025 7:51 UTC
9 points
0 comments3 min readLW link