All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025 2026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Admiring the Guts of Things.

Melkor11 Jun 2018 23:12 UTC

22 points

1 comment3 min readLW link

A general model of safety-oriented AI development

Wei Dai11 Jun 2018 21:00 UTC

68 points

8 comments1 min readLW link

Thoughts on the Inner Bruce

LeoHolman11 Jun 2018 20:18 UTC

12 points

2 comments3 min readLW link

Announcing the second AI Safety Camp

Lachouette11 Jun 2018 18:59 UTC

34 points

0 comments1 min readLW link

The Alignment Newsletter #10: 06/11/18

Rohin Shah11 Jun 2018 16:00 UTC

16 points

0 comments9 min readLW link

Front Row Center

Zvi11 Jun 2018 13:50 UTC

31 points

12 comments2 min readLW link

(thezvi.wordpress.com)

A Loophole for Self-Applicative Soundness

Diffractor11 Jun 2018 7:57 UTC

2 points

4 comments2 min readLW link

AI and the paperclip problem (or: Economist solves control problem with one weird trick!)

fortyeridania11 Jun 2018 2:19 UTC

10 points

4 comments1 min readLW link

(voxeu.org)

Oops on Commodity Prices

sarahconstantin10 Jun 2018 15:40 UTC

148 points

8 comments2 min readLW link

(srconstantin.wordpress.com)

Resolving the Dr Evil Problem

Chris_Leong10 Jun 2018 11:56 UTC

10 points

8 comments3 min readLW link

Simplified Poker Conclusions

Zvi9 Jun 2018 21:50 UTC

65 points

2 comments5 min readLW link

(thezvi.wordpress.com)

Fundamentals of Formalisation Level 3: Set Theoretic Relations and Enumerability

philip_b9 Jun 2018 19:57 UTC

16 points

0 comments1 min readLW link

Unraveling the Failure’s Try

LeoHolman9 Jun 2018 14:34 UTC

9 points

11 comments2 min readLW link

Physics has laws, the Universe might not

Shmi9 Jun 2018 5:33 UTC

25 points

23 comments3 min readLW link

Could we send a message to the distant future?

paulfchristiano9 Jun 2018 4:27 UTC

37 points

23 comments3 min readLW link

RFC: Meta-ethical uncertainty in AGI alignment

Gordon Seidoh Worley8 Jun 2018 20:56 UTC

16 points

6 comments3 min readLW link

Describing LessWrong in one paragraph

ChristianKl8 Jun 2018 20:54 UTC

16 points

6 comments1 min readLW link

Quantum AI Goal

Gurkenglas8 Jun 2018 16:55 UTC

−1 points

5 comments1 min readLW link

Quantum AI Box

Gurkenglas8 Jun 2018 16:20 UTC

4 points

15 comments1 min readLW link

Effective Altruism as Global Catastrophe Mitigation

Evan_Gaensbauer8 Jun 2018 4:17 UTC

9 points

0 comments22 min readLW link

Poker example: (not) deducing someone’s preferences

Stuart_Armstrong8 Jun 2018 3:19 UTC

16 points

5 comments3 min readLW link

The Incoherence of Honesty

Gordon Seidoh Worley8 Jun 2018 2:28 UTC

20 points

16 comments3 min readLW link

Reflections on Berkeley REACH

stardust8 Jun 2018 0:02 UTC

123 points

9 comments14 min readLW link

Beyond Astronomical Waste

Wei Dai7 Jun 2018 21:04 UTC

150 points

41 comments3 min readLW link

The first AI Safety Camp & onwards

Remmelt7 Jun 2018 20:13 UTC

46 points

0 comments8 min readLW link

A Rationalist Argument for Voting

Jameson Quinn7 Jun 2018 17:05 UTC

11 points

31 comments3 min readLW link

How to intro Effective Altruism

ChaosHufflepuff7 Jun 2018 10:24 UTC

5 points

5 comments1 min readLW link

Washington, D.C.: What Have You Read Recently?

RobinZ7 Jun 2018 2:30 UTC

8 points

0 comments1 min readLW link

Glide #1.5: New Criticism and Rationality

musicmage41147 Jun 2018 0:36 UTC

6 points

2 comments3 min readLW link

Hufflepuff Leadership and Fighting Entropy

Raemon7 Jun 2018 0:28 UTC

51 points

2 comments4 min readLW link

Bug Report

Evan_Gaensbauer6 Jun 2018 21:41 UTC

10 points

6 comments1 min readLW link

Monty Hall in the Wild

Jacob Falkovich6 Jun 2018 18:03 UTC

24 points

9 comments6 min readLW link

Simplified Poker Strategy

Zvi6 Jun 2018 11:10 UTC

40 points

0 comments2 min readLW link

(thezvi.wordpress.com)

Resource-Limited Reflective Oracles

Diffractor6 Jun 2018 2:50 UTC

16 points

2 comments4 min readLW link

Disambiguating “alignment” and related notions

David Scott Krueger (formerly: capybaralet)5 Jun 2018 15:35 UTC

22 points

21 comments2 min readLW link

The Arc of Time

notetofutureself5 Jun 2018 6:21 UTC

−19 points

1 comment1 min readLW link

Prisoners’ Dilemma with Costs to Modeling

Scott Garrabrant5 Jun 2018 4:51 UTC

123 points

20 comments7 min readLW link

A line of defense against unfriendly outcomes: Grover’s Algorithm

Gurkenglas5 Jun 2018 0:59 UTC

2 points

0 comments3 min readLW link

The Alignment Newsletter #9: 06/04/18

Rohin Shah4 Jun 2018 16:00 UTC

8 points

0 comments2 min readLW link

Simplified Poker

Zvi4 Jun 2018 15:50 UTC

36 points

17 comments1 min readLW link

(thezvi.wordpress.com)

Teaching Methodologies & Techniques

ChaosHufflepuff4 Jun 2018 11:33 UTC

9 points

10 comments1 min readLW link

The lesswrong slack—an introduction to our regulars

Elo4 Jun 2018 6:29 UTC

29 points

2 comments6 min readLW link

Against accusing people of motte and bailey

Kaj_Sotala3 Jun 2018 21:31 UTC

43 points

14 comments4 min readLW link

Excessive EDA Effortposting

abstractapplic3 Jun 2018 19:17 UTC

44 points

2 comments10 min readLW link

Using Intellectual Processes to Combat Bias

JustinCEO3 Jun 2018 14:42 UTC

−25 points

11 comments1 min readLW link

(rationalessays.com)

Swimming Upstream: A Case Study in Instrumental Rationality

TurnTrout3 Jun 2018 3:16 UTC

77 points

7 comments8 min readLW link

Trajectory

Logan Riggs2 Jun 2018 18:29 UTC

6 points

0 comments2 min readLW link

Why kids stop asking why

Expipiplusone2 Jun 2018 17:03 UTC

3 points

8 comments1 min readLW link

Sleeping Beauty Resolved (?) Pt. 2: Identity and Betting

ksvanhorn2 Jun 2018 2:43 UTC

9 points

50 comments6 min readLW link

Three types of “should”

Sniffnoy2 Jun 2018 0:54 UTC

9 points

9 comments2 min readLW link