Oliver Sourbut

Karma: 1,846

oliversourbut.net

Autonomous Systems @ UK AI Safety Institute (AISI)
DPhil AI Safety @ Oxford (Hertford college, CS dept, AIMS CDT)
Former senior data scientist and software engineer + SERI MATS

I’m particularly interested in sustainable collaboration and the long-term future of value. I’d love to contribute to a safer and more prosperous future with AI! Always interested in discussions about axiology, x-risks, s-risks.

I enjoy meeting new perspectives and growing my understanding of the world and the people in it. I also love to read—let me know your suggestions! In no particular order, here are some I’ve enjoyed recently

Ord—The Precipice
Pearl—The Book of Why
Bostrom—Superintelligence
McCall Smith—The No. 1 Ladies’ Detective Agency (and series)
Melville—Moby-Dick
Abelson & Sussman—Structure and Interpretation of Computer Programs
Stross—Accelerando
Graeme—The Rosie Project (and trilogy)

Cooperative gaming is a relatively recent but fruitful interest for me. Here are some of my favourites

Hanabi (can’t recommend enough; try it out!)
Pandemic (ironic at time of writing...)
Dungeons and Dragons (I DM a bit and it keeps me on my creative toes)
Overcooked (my partner and I enjoy the foody themes and frantic realtime coordination playing this)

People who’ve got to know me only recently are sometimes surprised to learn that I’m a pretty handy trumpeter and hornist.

How did ‘large’ language models get that way? The role of Transformers and Pretraining in GPT

Oliver Sourbut3 May 2026 21:35 UTC

16 points

0 comments7 min readLW link

(www.oliversourbut.net)

Is the Cat Out of the Bag?: Who knows how to make AGI?

Oliver Sourbut24 Apr 2026 21:49 UTC

33 points

0 comments4 min readLW link

(www.oliversourbut.net)

“Best humans still outperform”: One turning point in the history of cope around artificial intelligence

Oliver Sourbut17 Apr 2026 14:10 UTC

28 points

6 comments3 min readLW link

(www.oliversourbut.net)

Defense-favoured coordination design sketches

owencb, Oliver Sourbut, Lizka and rosehadshar

6 Apr 2026 15:19 UTC

18 points

6 comments25 min readLW link

(www.forethought.org)

Orders of magnitude: use semitones, not decibels

Oliver Sourbut1 Apr 2026 22:41 UTC

56 points

4 comments4 min readLW link

(www.oliversourbut.net)

Strategic awareness tools: design sketches

rosehadshar, owencb, Lizka and Oliver Sourbut

11 Feb 2026 12:28 UTC

18 points

2 comments1 min readLW link

(www.forethought.org)

Design sketches for a more sensible world

owencb, Lizka, Oliver Sourbut and rosehadshar

9 Feb 2026 10:22 UTC

26 points

2 comments4 min readLW link

(www.forethought.org)

Design sketches for angels-on-the-shoulder

owencb, Lizka, Oliver Sourbut and rosehadshar

9 Feb 2026 9:52 UTC

23 points

0 comments2 min readLW link

(www.forethought.org)

AI for Human Reasoning for Rationalists

Oliver Sourbut3 Feb 2026 13:22 UTC

29 points

0 comments4 min readLW link

(www.oliversourbut.net)

[Question] Another Cost Disease? We are all capitalists now

Oliver Sourbut9 Jan 2026 13:07 UTC

16 points

11 comments2 min readLW link

A Full Epistemic Stack: Knowledge Commons for the 21st Century

Oliver Sourbut and Ben Goldhaber

19 Dec 2025 22:48 UTC

44 points

7 comments11 min readLW link

(www.oliversourbut.net)

Better than logarithmic returns to reasoning?

Oliver Sourbut30 Jul 2025 0:50 UTC

14 points

5 comments3 min readLW link

(www.oliversourbut.net)

Do LLMs know what they’re capable of? Why this matters for AI safety, and initial findings

Casey Barkan, Sid Black and Oliver Sourbut

13 Jul 2025 19:54 UTC

53 points

5 comments18 min readLW link

You Can’t Skip Exploration: Why understanding experimentation and taste is key to understanding AI

Oliver Sourbut21 May 2025 16:08 UTC

20 points

0 comments11 min readLW link

(www.oliversourbut.net)

FLF Fellowship on AI for Human Reasoning: $25-50k, 12 weeks

Oliver Sourbut and Ben Goldhaber

19 May 2025 13:25 UTC

76 points

1 comment2 min readLW link

(www.flf.org)

Deceptive Alignment and Homuncularity

Oliver Sourbut and TurnTrout

16 Jan 2025 13:55 UTC

26 points

12 comments22 min readLW link

Cooperation and Alignment in Delegation Games: You Need Both!

Oliver Sourbut, Lewis Hammond and HarrietW

3 Aug 2024 10:16 UTC

9 points

0 comments14 min readLW link

(www.oliversourbut.net)

[Question] Terminology: <something>-ware for ML?

Oliver Sourbut3 Jan 2024 11:42 UTC

17 points

27 comments1 min readLW link

Alignment, conflict, powerseeking

Oliver Sourbut22 Nov 2023 9:47 UTC

7 points

1 comment1 min readLW link

Careless talk on US-China AI competition? (and criticism of CAIS coverage)

Oliver Sourbut20 Sep 2023 12:46 UTC

18 points

3 comments10 min readLW link 3 reviews

(www.oliversourbut.net)