RSS

Benjamin Hilton

Karma: 378

Head of Alignment at UK AI Security Institute (AISI). Previously 80,000 Hours, HM Treasury, Cabinet Office, Department for International Trade, Imperial College London.

An al­ign­ment safety case sketch based on debate

May 8, 2025, 3:02 PM
57 points
19 comments25 min readLW link
(arxiv.org)

UK AISI’s Align­ment Team: Re­search Agenda

May 7, 2025, 4:33 PM
111 points
2 comments11 min readLW link

A sketch of an AI con­trol safety case

Jan 30, 2025, 5:28 PM
57 points
0 comments5 min readLW link

Au­toma­tion collapse

Oct 21, 2024, 2:50 PM
72 points
9 comments7 min readLW link

Should you work at a lead­ing AI lab? (in­clud­ing in non-safety roles)

Benjamin HiltonJul 25, 2023, 4:29 PM
7 points
0 comments12 min readLW link

AI safety tech­ni­cal re­search—Ca­reer review

Benjamin HiltonJul 17, 2023, 3:34 PM
14 points
0 commentsLW link

How many peo­ple are work­ing (di­rectly) on re­duc­ing ex­is­ten­tial risk from AI?

Benjamin HiltonJan 18, 2023, 8:46 AM
20 points
1 commentLW link

Anony­mous ad­vice: If you want to re­duce AI risk, should you take roles that ad­vance AI ca­pa­bil­ities?

Benjamin HiltonOct 11, 2022, 2:16 PM
54 points
9 commentsLW link

New 80,000 Hours prob­lem pro­file on ex­is­ten­tial risks from AI

Benjamin HiltonAug 31, 2022, 5:36 PM
28 points
6 comments7 min readLW link
(80000hours.org)