All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 201720182019 2020 2021 2022 2023 2024 2025 2026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 181920 21 22 23 24 25 26 27 28 29 30

The Alignment Newsletter #11: 06/18/18

Rohin Shah18 Jun 2018 16:00 UTC

8 points

0 comments10 min readLW link

Why Destructive Value Capture?

Zvi18 Jun 2018 12:20 UTC

29 points

13 comments4 min readLW link

(thezvi.wordpress.com)

Privacy: Defining Yourself

Lulie18 Jun 2018 2:15 UTC

39 points

5 comments3 min readLW link

In Defense of Ambiguous Problems

Chris_Leong17 Jun 2018 7:40 UTC

6 points

6 comments2 min readLW link

Fundamentals of Formalisation Level 4: Formal Semantics Basics

philip_b16 Jun 2018 19:09 UTC

12 points

0 comments1 min readLW link

How many philosophers accept the orthogonality thesis ? Evidence from the PhilPapers survey

Paperclip Minimizer16 Jun 2018 12:11 UTC

3 points

26 comments3 min readLW link

Worrying about the Vase: Whitelisting

TurnTrout16 Jun 2018 2:17 UTC

73 points

26 comments11 min readLW link

Merging accounts

Chris_Leong16 Jun 2018 0:45 UTC

5 points

2 comments1 min readLW link

The Curious Prisoner Puzzle

Chris_Leong16 Jun 2018 0:40 UTC

4 points

14 comments1 min readLW link

Geoffrey Miller on Effective Altruism and Rationality

Jacob Falkovich15 Jun 2018 17:05 UTC

19 points

0 comments1 min readLW link

(putanumonit.com)

Aligned AI May Depend on Moral Facts

Gordon Seidoh Worley15 Jun 2018 1:33 UTC

8 points

11 comments1 min readLW link

SIAM Lecture: How Paradoxes Shape Mathematics and Give Us Self-Verifying Computer Programs

ldsrrs14 Jun 2018 20:58 UTC

3 points

0 comments1 min readLW link

(meetings.siam.org)

We Agree: Speeches All Around!

SebastianG 14 Jun 2018 17:53 UTC

37 points

19 comments2 min readLW link

Weak arguments against the universal prior being malign

X4vier14 Jun 2018 17:11 UTC

50 points

23 comments3 min readLW link

Notes on a recent wave of spam

rossry14 Jun 2018 15:39 UTC

11 points

2 comments1 min readLW link

Logical Inductor Tiling and Why it’s Hard

Diffractor14 Jun 2018 6:34 UTC

4 points

0 comments12 min readLW link

Washington, D.C.: Anxiety

RobinZ14 Jun 2018 1:17 UTC

15 points

0 comments1 min readLW link

Anthropics made easy?

Stuart_Armstrong14 Jun 2018 0:56 UTC

32 points

61 comments6 min readLW link

Counterfactual Mugging Poker Game

Scott Garrabrant13 Jun 2018 23:34 UTC

134 points

4 comments1 min readLW link

On the Chatham House Rule

Scott Garrabrant13 Jun 2018 21:41 UTC

69 points

25 comments4 min readLW link 1 review

[Math] Towards Proof Writing as a Skill In Itself

Andrew Quinn13 Jun 2018 4:39 UTC

25 points

8 comments2 min readLW link

Today a Tragedy

Logan Riggs13 Jun 2018 1:58 UTC

54 points

17 comments1 min readLW link

Epistemological Braces

musicmage411412 Jun 2018 22:01 UTC

1 point

2 comments6 min readLW link

LW Update 2018-06-11 – Vulcan Refactor, Karma Overhaul, Colored Links, Moderation Log

Raemon12 Jun 2018 0:49 UTC

32 points

34 comments3 min readLW link

Admiring the Guts of Things.

Melkor11 Jun 2018 23:12 UTC

22 points

1 comment3 min readLW link

A general model of safety-oriented AI development

Wei Dai11 Jun 2018 21:00 UTC

68 points

8 comments1 min readLW link

Thoughts on the Inner Bruce

LeoHolman11 Jun 2018 20:18 UTC

12 points

2 comments3 min readLW link

Announcing the second AI Safety Camp

Lachouette11 Jun 2018 18:59 UTC

34 points

0 comments1 min readLW link

The Alignment Newsletter #10: 06/11/18

Rohin Shah11 Jun 2018 16:00 UTC

16 points

0 comments9 min readLW link

Front Row Center

Zvi11 Jun 2018 13:50 UTC

31 points

12 comments2 min readLW link

(thezvi.wordpress.com)

A Loophole for Self-Applicative Soundness

Diffractor11 Jun 2018 7:57 UTC

2 points

4 comments2 min readLW link

AI and the paperclip problem (or: Economist solves control problem with one weird trick!)

fortyeridania11 Jun 2018 2:19 UTC

10 points

4 comments1 min readLW link

(voxeu.org)

Oops on Commodity Prices

sarahconstantin10 Jun 2018 15:40 UTC

148 points

8 comments2 min readLW link

(srconstantin.wordpress.com)

Resolving the Dr Evil Problem

Chris_Leong10 Jun 2018 11:56 UTC

10 points

8 comments3 min readLW link

Simplified Poker Conclusions

Zvi9 Jun 2018 21:50 UTC

65 points

2 comments5 min readLW link

(thezvi.wordpress.com)

Fundamentals of Formalisation Level 3: Set Theoretic Relations and Enumerability

philip_b9 Jun 2018 19:57 UTC

16 points

0 comments1 min readLW link

Unraveling the Failure’s Try

LeoHolman9 Jun 2018 14:34 UTC

9 points

11 comments2 min readLW link

Physics has laws, the Universe might not

Shmi9 Jun 2018 5:33 UTC

25 points

23 comments3 min readLW link

Could we send a message to the distant future?

paulfchristiano9 Jun 2018 4:27 UTC

37 points

23 comments3 min readLW link

RFC: Meta-ethical uncertainty in AGI alignment

Gordon Seidoh Worley8 Jun 2018 20:56 UTC

16 points

6 comments3 min readLW link

Describing LessWrong in one paragraph

ChristianKl8 Jun 2018 20:54 UTC

16 points

6 comments1 min readLW link

Quantum AI Goal

Gurkenglas8 Jun 2018 16:55 UTC

−1 points

5 comments1 min readLW link

Quantum AI Box

Gurkenglas8 Jun 2018 16:20 UTC

4 points

15 comments1 min readLW link

Effective Altruism as Global Catastrophe Mitigation

Evan_Gaensbauer8 Jun 2018 4:17 UTC

9 points

0 comments22 min readLW link

Poker example: (not) deducing someone’s preferences

Stuart_Armstrong8 Jun 2018 3:19 UTC

16 points

5 comments3 min readLW link

The Incoherence of Honesty

Gordon Seidoh Worley8 Jun 2018 2:28 UTC

20 points

16 comments3 min readLW link

Reflections on Berkeley REACH

stardust8 Jun 2018 0:02 UTC

123 points

9 comments14 min readLW link

Beyond Astronomical Waste

Wei Dai7 Jun 2018 21:04 UTC

150 points

41 comments3 min readLW link

The first AI Safety Camp & onwards

Remmelt7 Jun 2018 20:13 UTC

46 points

0 comments8 min readLW link

A Rationalist Argument for Voting

Jameson Quinn7 Jun 2018 17:05 UTC

11 points

31 comments3 min readLW link