RSS

Richard_Ngo

Karma: 18,789

Formerly alignment and governance researcher at DeepMind and OpenAI. Now independent.

Well-found­ed­ness as an or­ga­niz­ing prin­ci­ple of healthy minds and societies

Richard_Ngo7 Apr 2025 0:31 UTC
35 points
7 comments6 min readLW link
(www.mindthefuture.info)

Third-wave AI safety needs so­ciopoli­ti­cal thinking

Richard_Ngo27 Mar 2025 0:55 UTC
99 points
23 comments26 min readLW link

Towards a scale-free the­ory of in­tel­li­gent agency

Richard_Ngo21 Mar 2025 1:39 UTC
96 points
44 comments13 min readLW link
(www.mindthefuture.info)

Elite Co­or­di­na­tion via the Con­sen­sus of Power

Richard_Ngo19 Mar 2025 6:56 UTC
92 points
15 comments12 min readLW link
(www.mindthefuture.info)

Tro­jan Sky

Richard_Ngo11 Mar 2025 3:14 UTC
245 points
39 comments12 min readLW link
(www.narrativeark.xyz)

Power Lies Trem­bling: a three-book review

Richard_Ngo22 Feb 2025 22:57 UTC
213 points
27 comments15 min readLW link
(www.mindthefuture.info)

The Gen­tle Romance

Richard_Ngo19 Jan 2025 18:29 UTC
242 points
46 comments15 min readLW link
(www.asimov.press)

From the Archives: a story

Richard_Ngo27 Dec 2024 16:36 UTC
20 points
1 comment16 min readLW link
(www.narrativeark.xyz)

Epistemic sta­tus: po­etry (and other po­ems)

Richard_Ngo21 Nov 2024 18:13 UTC
51 points
5 comments2 min readLW link
(www.narrativeark.xyz)

Why I’m not a Bayesian

Richard_Ngo6 Oct 2024 15:22 UTC
211 points
101 comments10 min readLW link
(www.mindthefuture.info)

Defin­ing al­ign­ment research

Richard_Ngo19 Aug 2024 20:42 UTC
92 points
23 comments7 min readLW link

Green and golden: a meditation

Richard_Ngo18 Aug 2024 1:36 UTC
21 points
0 comments3 min readLW link
(www.narrativeark.xyz)

Twit­ter thread on open-source AI

Richard_Ngo31 Jul 2024 0:26 UTC
33 points
6 comments2 min readLW link
(x.com)

Twit­ter thread on AI takeover scenarios

Richard_Ngo31 Jul 2024 0:24 UTC
37 points
0 comments2 min readLW link
(x.com)

Twit­ter thread on AI safety evals

Richard_Ngo31 Jul 2024 0:18 UTC
63 points
3 comments2 min readLW link
(x.com)

Twit­ter thread on poli­tics of AI safety

Richard_Ngo31 Jul 2024 0:00 UTC
35 points
2 comments1 min readLW link
(x.com)

Coal­i­tional agency

Richard_Ngo22 Jul 2024 0:09 UTC
56 points
6 comments6 min readLW link

A more sys­tem­atic case for in­ner misalignment

Richard_Ngo20 Jul 2024 5:03 UTC
31 points
4 comments5 min readLW link

Towards more co­op­er­a­tive AI safety strategies

Richard_Ngo16 Jul 2024 4:36 UTC
215 points
133 comments4 min readLW link

A sim­ple case for ex­treme in­ner misalignment

Richard_Ngo13 Jul 2024 15:40 UTC
84 points
41 comments7 min readLW link