All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 201820192020 2021 2022 2023 2024 2025 2026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Power Buys You Distance From The Crime

Elizabeth2 Aug 2019 20:50 UTC

219 points

75 comments7 min readLW link 1 review

(acesounderglass.com)

Why Subagents?

johnswentworth1 Aug 2019 22:17 UTC

179 points

50 comments7 min readLW link 1 review

The Commitment Races problem

Daniel Kokotajlo23 Aug 2019 1:58 UTC

176 points

56 comments5 min readLW link

Soft takeoff can still lead to decisive strategic advantage

Daniel Kokotajlo23 Aug 2019 16:39 UTC

122 points

47 comments8 min readLW link 4 reviews

Subagents, trauma and rationality

Kaj_Sotala14 Aug 2019 13:14 UTC

113 points

4 comments19 min readLW link

Trauma, Meditation, and a Cool Scar

Logan Riggs6 Aug 2019 16:17 UTC

102 points

17 comments5 min readLW link 1 review

[Question] Can we really prevent all warming for less than 10B$ with the mostly side-effect free geoengineering technique of Marine Cloud Brightening?

mako yass5 Aug 2019 0:12 UTC

96 points

55 comments2 min readLW link

Partial summary of debate with Benquo and Jessicata [pt 1]

Raemon14 Aug 2019 20:02 UTC

89 points

63 comments22 min readLW link 3 reviews

Troll Bridge

abramdemski23 Aug 2019 18:36 UTC

88 points

59 comments12 min readLW link

Subagents, neural Turing machines, thought selection, and blindspots

Kaj_Sotala6 Aug 2019 21:15 UTC

87 points

3 comments12 min readLW link

Problems in AI Alignment that philosophers could potentially contribute to

Wei Dai17 Aug 2019 17:38 UTC

86 points

14 comments2 min readLW link

2-D Robustness

Vlad Mikulik30 Aug 2019 20:27 UTC

86 points

8 comments2 min readLW link

Clarifying some key hypotheses in AI alignment

Ben Cottier and Rohin Shah

15 Aug 2019 21:29 UTC

79 points

12 comments9 min readLW link

Markets are Universal for Logical Induction

johnswentworth22 Aug 2019 6:44 UTC

78 points

3 comments5 min readLW link

Six AI Risk/Strategy Ideas

Wei Dai27 Aug 2019 0:40 UTC

73 points

17 comments4 min readLW link 1 review

Classifying specification problems as variants of Goodhart’s Law

Vika19 Aug 2019 20:40 UTC

72 points

5 comments5 min readLW link 1 review

[Question] Does Agent-like Behavior Imply Agent-like Architecture?

Scott Garrabrant23 Aug 2019 2:01 UTC

72 points

9 comments1 min readLW link

Response to Glen Weyl on Technocracy and the Rationalist Community

John_Maxwell22 Aug 2019 23:14 UTC

66 points

9 comments10 min readLW link

[Question] Why so much variance in human intelligence?

Ben Pace22 Aug 2019 22:36 UTC

65 points

28 comments4 min readLW link

Book Review: Secular Cycles

Scott Alexander13 Aug 2019 4:10 UTC

63 points

10 comments16 min readLW link 1 review

(slatestarcodex.com)

Dual Wielding

Zvi27 Aug 2019 14:10 UTC

60 points

23 comments2 min readLW link 3 reviews

(thezvi.wordpress.com)

How to Make Billions of Dollars Reducing Loneliness

John_Maxwell30 Aug 2019 17:30 UTC

60 points

32 comments7 min readLW link

Schelling Categories, and Simple Membership Tests

Zack_M_Davis26 Aug 2019 2:43 UTC

60 points

10 comments8 min readLW link

Tabooing ‘Agent’ for Prosaic Alignment

Hjalmar_Wijk23 Aug 2019 2:55 UTC

57 points

10 comments6 min readLW link

Zeno walks into a bar

lsusr4 Aug 2019 7:00 UTC

56 points

4 comments2 min readLW link

Actually updating

SaraHax23 Aug 2019 17:46 UTC

56 points

10 comments4 min readLW link

Intentional Bucket Errors

Scott Garrabrant22 Aug 2019 20:02 UTC

55 points

6 comments3 min readLW link

Computational Model: Causal Diagrams with Symmetry

johnswentworth22 Aug 2019 17:54 UTC

54 points

31 comments4 min readLW link

Permissions in Governance

sarahconstantin2 Aug 2019 19:50 UTC

53 points

12 comments8 min readLW link

(srconstantin.wordpress.com)

A Personal Rationality Wishlist

DanielFilan27 Aug 2019 3:40 UTC

53 points

54 comments4 min readLW link

(danielfilan.com)

AI Forecasting Dictionary (Forecasting infrastructure, part 1)

Bird Concept and Ben Goldhaber

8 Aug 2019 16:10 UTC

50 points

0 comments5 min readLW link

Vaniver’s View on Factored Cognition

Vaniver23 Aug 2019 2:54 UTC

48 points

4 comments8 min readLW link

Status 451 on Diagnosis: Russell Aphasia

Zack_M_Davis6 Aug 2019 4:43 UTC

48 points

1 comment1 min readLW link

(status451.com)

Towards a mechanistic understanding of corrigibility

evhub22 Aug 2019 23:20 UTC

47 points

26 comments4 min readLW link

September Bragging Thread

Raemon30 Aug 2019 21:58 UTC

47 points

12 comments1 min readLW link

[Link] Book Review: Reframing Superintelligence (SSC)

ioannes28 Aug 2019 22:57 UTC

46 points

9 comments2 min readLW link

[Question] How Can People Evaluate Complex Questions Consistently?

Elizabeth26 Aug 2019 20:33 UTC

46 points

12 comments1 min readLW link

New paper: Corrigibility with Utility Preservation

Koen.Holtman6 Aug 2019 19:04 UTC

44 points

11 comments2 min readLW link

Embedded Agency via Abstraction

johnswentworth26 Aug 2019 23:03 UTC

42 points

20 comments11 min readLW link

Cephaloponderings

Jacob Falkovich4 Aug 2019 16:45 UTC

41 points

4 comments7 min readLW link

My recommendations for gratitude exercises

MaxCarpendale5 Aug 2019 19:04 UTC

40 points

3 comments5 min readLW link

The Missing Math of Map-Making

johnswentworth28 Aug 2019 21:18 UTC

40 points

8 comments2 min readLW link

LW Team Updates—September 2019

Ruby29 Aug 2019 22:12 UTC

39 points

13 comments2 min readLW link

Epistemic Spot Check: The Fate of Rome (Kyle Harper)

Elizabeth24 Aug 2019 21:40 UTC

39 points

3 comments5 min readLW link

(acesounderglass.com)

Call for contributors to the Alignment Newsletter

Rohin Shah21 Aug 2019 18:21 UTC

39 points

0 comments4 min readLW link

Optimization Provenance

Adele Lopez23 Aug 2019 20:08 UTC

38 points

5 comments5 min readLW link

Unstriving

Jacob Falkovich19 Aug 2019 14:31 UTC

38 points

7 comments6 min readLW link

Diana Fleischman and Geoffrey Miller—Audience Q&A

Jacob Falkovich10 Aug 2019 22:37 UTC

38 points

6 comments9 min readLW link

When do utility functions constrain?

Hoagy23 Aug 2019 17:19 UTC

37 points

8 comments7 min readLW link

Mistake Versus Conflict Theory of Against Billionaire Philanthropy

Zvi1 Aug 2019 13:10 UTC

37 points

34 comments3 min readLW link

(thezvi.wordpress.com)