RSS

Jacob Pfau

Karma: 1,577

UK AISI Alignment Team

Se­quent: scale and au­toma­tion for higher con­fi­dence in alignment

10 Jun 2026 15:37 UTC
276 points
2 comments11 min readLW link
(sequent.org)

Au­to­mated Align­ment is Harder Than You Think

14 May 2026 22:01 UTC
143 points
7 comments3 min readLW link
(arxiv.org)

From per­sonas to in­ten­tions: to­wards a sci­ence of mo­ti­va­tions for AI models

14 Apr 2026 12:26 UTC
77 points
5 comments7 min readLW link