RSS

Oliver Sourbut

Karma: 1,592

oliversourbut.net

I’m particularly interested in sustainable collaboration and the long-term future of value. I’d love to contribute to a safer and more prosperous future with AI! Always interested in discussions about axiology, x-risks, s-risks.

I enjoy meeting new perspectives and growing my understanding of the world and the people in it. I also love to read—let me know your suggestions! In no particular order, here are some I’ve enjoyed recently

Cooperative gaming is a relatively recent but fruitful interest for me. Here are some of my favourites

People who’ve got to know me only recently are sometimes surprised to learn that I’m a pretty handy trumpeter and hornist.

Strate­gic aware­ness tools: de­sign sketches

11 Feb 2026 12:28 UTC
18 points
0 comments1 min readLW link
(www.forethought.org)

De­sign sketches for a more sen­si­ble world

9 Feb 2026 10:22 UTC
25 points
2 comments4 min readLW link
(www.forethought.org)

De­sign sketches for an­gels-on-the-shoulder

9 Feb 2026 9:52 UTC
23 points
0 comments2 min readLW link
(www.forethought.org)

AI for Hu­man Rea­son­ing for Rationalists

Oliver Sourbut3 Feb 2026 13:22 UTC
29 points
0 comments4 min readLW link

[Question] Another Cost Disease? We are all cap­i­tal­ists now

Oliver Sourbut9 Jan 2026 13:07 UTC
16 points
11 comments2 min readLW link

A Full Epistemic Stack: Knowl­edge Com­mons for the 21st Century

19 Dec 2025 22:48 UTC
44 points
7 comments11 min readLW link
(www.oliversourbut.net)

Bet­ter than log­a­r­ith­mic re­turns to rea­son­ing?

Oliver Sourbut30 Jul 2025 0:50 UTC
14 points
5 comments2 min readLW link

Do LLMs know what they’re ca­pa­ble of? Why this mat­ters for AI safety, and ini­tial findings

13 Jul 2025 19:54 UTC
53 points
5 comments18 min readLW link

You Can’t Skip Ex­plo­ra­tion: Why un­der­stand­ing ex­per­i­men­ta­tion and taste is key to un­der­stand­ing AI

Oliver Sourbut21 May 2025 16:08 UTC
20 points
0 comments11 min readLW link
(www.oliversourbut.net)

FLF Fel­low­ship on AI for Hu­man Rea­son­ing: $25-50k, 12 weeks

19 May 2025 13:25 UTC
76 points
1 comment2 min readLW link
(www.flf.org)

De­cep­tive Align­ment and Homuncularity

16 Jan 2025 13:55 UTC
26 points
12 comments22 min readLW link

Co­op­er­a­tion and Align­ment in Del­e­ga­tion Games: You Need Both!

3 Aug 2024 10:16 UTC
9 points
0 comments14 min readLW link
(www.oliversourbut.net)

[Question] Ter­minol­ogy: <some­thing>-ware for ML?

Oliver Sourbut3 Jan 2024 11:42 UTC
17 points
27 comments1 min readLW link

Align­ment, con­flict, powerseeking

Oliver Sourbut22 Nov 2023 9:47 UTC
7 points
1 comment1 min readLW link

Care­less talk on US-China AI com­pe­ti­tion? (and crit­i­cism of CAIS cov­er­age)

Oliver Sourbut20 Sep 2023 12:46 UTC
18 points
3 comments10 min readLW link3 reviews
(www.oliversourbut.net)

In­vad­ing Aus­tralia (End­less Former­lies Most Beau­tiful, or What I Learned On My Holi­day)

Oliver Sourbut8 Sep 2023 15:33 UTC
14 points
1 comment8 min readLW link
(www.oliversourbut.net)

Hert­ford, Sour­but (ra­tio­nal­ity les­sons from Univer­sity Challenge)

Oliver Sourbut4 Sep 2023 18:44 UTC
30 points
7 comments14 min readLW link
(www.oliversourbut.net)

Un-un­plug­ga­bil­ity—can’t we just un­plug it?

Oliver Sourbut15 May 2023 13:23 UTC
28 points
10 comments12 min readLW link
(www.oliversourbut.net)

Oliver Sour­but’s Shortform

Oliver Sourbut14 Jul 2022 15:39 UTC
6 points
29 comments1 min readLW link

De­liber­a­tion Every­where: Sim­ple Examples

Oliver Sourbut27 Jun 2022 17:26 UTC
28 points
3 comments15 min readLW link