All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Contra shard theory, in the context of the diamond maximizer problem

So8res13 Oct 2022 23:51 UTC

107 points

19 comments2 min readLW link 1 review

Greed Is the Root of This Evil

Thane Ruthenis13 Oct 2022 20:40 UTC

21 points

7 comments8 min readLW link

Vehicle Platooning—a real world examination of the difficulties in coordination

M. Y. Zuo13 Oct 2022 19:33 UTC

24 points

6 comments2 min readLW link

The Vitalik Buterin Fellowship in AI Existential Safety is open for applications!

Xin Chen, Cynthia13 Oct 2022 18:32 UTC

21 points

0 comments2 min readLW link

Feelings

Eris Discordia13 Oct 2022 17:48 UTC

9 points

0 comments9 min readLW link

Against the normative realist’s wager

Joe Carlsmith13 Oct 2022 16:35 UTC

16 points

9 comments23 min readLW link

Weekly Non-Covid News #1 (10/13/22)

Zvi13 Oct 2022 15:40 UTC

52 points

16 comments16 min readLW link

(thezvi.wordpress.com)

Misalignment-by-default in multi-agent systems

Edouard Harris and simonsdsuo

13 Oct 2022 15:38 UTC

21 points

8 comments20 min readLW link

(www.gladstone.ai)

A stubborn unbeliever finally gets the depth of the AI alignment problem

aelwood13 Oct 2022 15:16 UTC

17 points

8 comments3 min readLW link

(pursuingreality.substack.com)

Covid 10/13/22: Just the Facts

Zvi13 Oct 2022 14:40 UTC

28 points

7 comments10 min readLW link

(thezvi.wordpress.com)

When should you defer to expertise? A useful heuristic (Crosspost from EA forum)

Noosphere8913 Oct 2022 14:14 UTC

9 points

3 comments2 min readLW link

(forum.effectivealtruism.org)

Cataloguing Priors in Theory and Practice

Paul Bricman13 Oct 2022 12:36 UTC

13 points

8 comments7 min readLW link

Transformative VR Is Likely Coming Soon

jimrandomh13 Oct 2022 6:25 UTC

90 points

47 comments2 min readLW link

Cambridge LW Meetup: See the Invisible

Tony Wang13 Oct 2022 5:44 UTC

1 point

0 comments1 min readLW link

Glossary Dance Game

jefftk13 Oct 2022 2:20 UTC

10 points

1 comment2 min readLW link

(www.jefftk.com)

Niceness is unnatural

So8res13 Oct 2022 1:30 UTC

136 points

20 comments8 min readLW link 1 review

A strange twist on the road to AGI

cveres12 Oct 2022 23:27 UTC

−8 points

0 comments1 min readLW link

Help out Redwood Research’s interpretability team by finding heuristics implemented by GPT-2 small

Haoxing Du and Buck

12 Oct 2022 21:25 UTC

50 points

11 comments4 min readLW link

Towards a comprehensive study of potential psychological causes of the ordinary range of variation of affective gender identity in males

tailcalled12 Oct 2022 21:10 UTC

56 points

7 comments37 min readLW link

Six (and a half) intuitions for KL divergence

CallumMcDougall12 Oct 2022 21:07 UTC

185 points

27 comments10 min readLW link 1 review

(www.perfectlynormal.co.uk)

[MLSN #6]: Transparency survey, provable robustness, ML models that predict the future

Dan H12 Oct 2022 20:56 UTC

27 points

0 comments6 min readLW link

[Question] Previous Work on Recreating Neural Network Input from Intermediate Layer Activations

bglass12 Oct 2022 19:28 UTC

1 point

3 comments1 min readLW link

Be more effective by learning important practical knowledge using flashcards

Stenemo12 Oct 2022 18:05 UTC

5 points

2 comments1 min readLW link

Article Review: Google’s AlphaTensor

Robert_AIZI12 Oct 2022 18:04 UTC

8 points

4 comments10 min readLW link

Alignment 201 curriculum

Richard_Ngo12 Oct 2022 18:03 UTC

102 points

3 comments1 min readLW link

(www.agisafetyfundamentals.com)

Progress links and tweets, 2022-10-12

jasoncrawford12 Oct 2022 16:59 UTC

8 points

0 comments1 min readLW link

(rootsofprogress.org)

Building a transformer from scratch—AI safety up-skilling challenge

Marius Hobbhahn12 Oct 2022 15:40 UTC

42 points

1 comment5 min readLW link

Instrumental convergence in single-agent systems

Edouard Harris and simonsdsuo

12 Oct 2022 12:24 UTC

33 points

4 comments8 min readLW link

(www.gladstone.ai)

Singapore—Small casual dinner in Chinatown #5

Joe Rocca12 Oct 2022 8:59 UTC

3 points

1 comment1 min readLW link

A game of mattering

KatjaGrace12 Oct 2022 8:50 UTC

30 points

2 comments5 min readLW link

(worldspiritsockpuppet.com)

Calibration of a thousand predictions

KatjaGrace12 Oct 2022 8:50 UTC

59 points

7 comments5 min readLW link

(worldspiritsockpuppet.com)

My argument against AGI

cveres12 Oct 2022 6:33 UTC

7 points

5 comments3 min readLW link

Actually, All Nuclear Famine Papers are Bunk

Lao Mein12 Oct 2022 5:58 UTC

113 points

37 comments2 min readLW link 1 review

Contingency is not arbitrary

Gordon Seidoh Worley12 Oct 2022 4:35 UTC

13 points

0 comments3 min readLW link

That one apocalyptic nuclear famine paper is bunk

Lao Mein12 Oct 2022 3:33 UTC

111 points

10 comments1 min readLW link

AstralCodexTen and Rationality Meetup Organisers’ Retreat Asia Pacific region

Elo and Harold

12 Oct 2022 3:20 UTC

14 points

4 comments2 min readLW link

Abbots Bromley Horn Dance History

jefftk12 Oct 2022 2:10 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

Power-Seeking AI and Existential Risk

Antonio Franca11 Oct 2022 22:50 UTC

7 points

0 comments9 min readLW link

From technocracy to the counterculture

jasoncrawford11 Oct 2022 19:37 UTC

28 points

1 comment26 min readLW link

(rootsofprogress.org)

Prettified AI Safety Game Cards

abramdemski11 Oct 2022 19:35 UTC

47 points

6 comments1 min readLW link

On the proper piloting of flesh shoots

Mordecai Weynberg11 Oct 2022 18:52 UTC

−4 points

6 comments1 min readLW link

Why I think nuclear war triggered by Russian tactical nukes in Ukraine is unlikely

Dave Orr11 Oct 2022 18:30 UTC

50 points

7 comments3 min readLW link

Misalignment Harms Can Be Caused by Low Intelligence Systems

DialecticEel11 Oct 2022 13:39 UTC

11 points

3 comments1 min readLW link

[Sketch] Validity Criterion for Logical Counterfactuals

DragonGod11 Oct 2022 13:31 UTC

6 points

0 comments6 min readLW link

[Question] How much does the risk of dying from nuclear war differ within and between countries?

amarai11 Oct 2022 11:55 UTC

4 points

7 comments1 min readLW link

Did you enjoy Ramez Naam’s “Nexus” trilogy? Check out this interview on neurotech and the law.

fowlertm11 Oct 2022 11:10 UTC

5 points

0 comments1 min readLW link

What “The Message” Was For Me

Alex Beyman11 Oct 2022 8:08 UTC

−3 points

14 comments4 min readLW link

Updates and Clarifications

SD Marlow11 Oct 2022 5:34 UTC

−5 points

1 comment1 min readLW link

What if human reasoning is anti-inductive?

Q Home11 Oct 2022 5:15 UTC

4 points

2 comments13 min readLW link

Fullness to Indicate Cleanliness

jefftk11 Oct 2022 0:40 UTC

9 points

12 comments1 min readLW link

(www.jefftk.com)