Ad­miring the Guts of Things.

Melkor11 Jun 2018 23:12 UTC
21 points
1 comment3 min readLW link

A gen­eral model of safety-ori­ented AI development

Wei Dai11 Jun 2018 21:00 UTC
65 points
8 comments1 min readLW link

Thoughts on the In­ner Bruce

LeoHolman11 Jun 2018 20:18 UTC
12 points
2 comments3 min readLW link

An­nounc­ing the sec­ond AI Safety Camp

Lachouette11 Jun 2018 18:59 UTC
34 points
0 comments1 min readLW link

The Align­ment Newslet­ter #10: 06/​11/​18

Rohin Shah11 Jun 2018 16:00 UTC
16 points
0 comments9 min readLW link

Front Row Center

Zvi11 Jun 2018 13:50 UTC
30 points
12 comments2 min readLW link
(thezvi.wordpress.com)

A Loop­hole for Self-Ap­plica­tive Soundness

Diffractor11 Jun 2018 7:57 UTC
2 points
4 comments2 min readLW link

AI and the pa­per­clip prob­lem (or: Economist solves con­trol prob­lem with one weird trick!)

fortyeridania11 Jun 2018 2:19 UTC
10 points
4 comments1 min readLW link
(voxeu.org)

Oops on Com­mod­ity Prices

sarahconstantin10 Jun 2018 15:40 UTC
148 points
8 comments2 min readLW link
(srconstantin.wordpress.com)

Re­solv­ing the Dr Evil Problem

Chris_Leong10 Jun 2018 11:56 UTC
10 points
8 comments3 min readLW link

Sim­plified Poker Conclusions

Zvi9 Jun 2018 21:50 UTC
64 points
2 comments5 min readLW link
(thezvi.wordpress.com)

Fun­da­men­tals of For­mal­i­sa­tion Level 3: Set The­o­retic Re­la­tions and Enumerability

philip_b9 Jun 2018 19:57 UTC
16 points
0 comments1 min readLW link

Un­rav­el­ing the Failure’s Try

LeoHolman9 Jun 2018 14:34 UTC
9 points
11 comments2 min readLW link

Physics has laws, the Uni­verse might not

shminux9 Jun 2018 5:33 UTC
25 points
23 comments3 min readLW link

Could we send a mes­sage to the dis­tant fu­ture?

paulfchristiano9 Jun 2018 4:27 UTC
37 points
23 comments3 min readLW link

RFC: Meta-eth­i­cal un­cer­tainty in AGI alignment

Gordon Seidoh Worley8 Jun 2018 20:56 UTC
16 points
6 comments3 min readLW link

De­scribing LessWrong in one paragraph

ChristianKl8 Jun 2018 20:54 UTC
16 points
6 comments1 min readLW link

Quan­tum AI Goal

Gurkenglas8 Jun 2018 16:55 UTC
−1 points
5 comments1 min readLW link

Quan­tum AI Box

Gurkenglas8 Jun 2018 16:20 UTC
4 points
15 comments1 min readLW link

Effec­tive Altru­ism as Global Catas­tro­phe Mitigation

Evan_Gaensbauer8 Jun 2018 4:17 UTC
9 points
0 comments22 min readLW link

Poker ex­am­ple: (not) de­duc­ing some­one’s preferences

Stuart_Armstrong8 Jun 2018 3:19 UTC
16 points
5 comments3 min readLW link

The In­co­her­ence of Honesty

Gordon Seidoh Worley8 Jun 2018 2:28 UTC
20 points
16 comments3 min readLW link

Reflec­tions on Berkeley REACH

stardust8 Jun 2018 0:02 UTC
123 points
9 comments14 min readLW link

Beyond Astro­nom­i­cal Waste

Wei Dai7 Jun 2018 21:04 UTC
125 points
41 comments3 min readLW link

The first AI Safety Camp & onwards

Remmelt7 Jun 2018 20:13 UTC
46 points
0 comments8 min readLW link

A Ra­tion­al­ist Ar­gu­ment for Voting

Jameson Quinn7 Jun 2018 17:05 UTC
11 points
31 comments3 min readLW link

How to in­tro Effec­tive Altruism

ChaosHufflepuff7 Jun 2018 10:24 UTC
5 points
5 comments1 min readLW link

Wash­ing­ton, D.C.: What Have You Read Re­cently?

RobinZ7 Jun 2018 2:30 UTC
8 points
0 comments1 min readLW link

Glide #1.5: New Crit­i­cism and Rationality

musicmage41147 Jun 2018 0:36 UTC
6 points
2 comments3 min readLW link

Hufflepuff Lead­er­ship and Fight­ing Entropy

Raemon7 Jun 2018 0:28 UTC
51 points
2 comments4 min readLW link

Bug Report

Evan_Gaensbauer6 Jun 2018 21:41 UTC
10 points
6 comments1 min readLW link

Monty Hall in the Wild

Jacob Falkovich6 Jun 2018 18:03 UTC
24 points
9 comments6 min readLW link

Sim­plified Poker Strategy

Zvi6 Jun 2018 11:10 UTC
40 points
0 comments2 min readLW link
(thezvi.wordpress.com)

Re­source-Limited Reflec­tive Oracles

Diffractor6 Jun 2018 2:50 UTC
5 points
1 comment4 min readLW link

Disam­biguat­ing “al­ign­ment” and re­lated no­tions

David Scott Krueger (formerly: capybaralet)5 Jun 2018 15:35 UTC
22 points
21 comments2 min readLW link

The Arc of Time

notetofutureself5 Jun 2018 6:21 UTC
−19 points
1 comment1 min readLW link

Pri­son­ers’ Dilemma with Costs to Modeling

Scott Garrabrant5 Jun 2018 4:51 UTC
123 points
20 comments7 min readLW link

A line of defense against un­friendly out­comes: Grover’s Algorithm

Gurkenglas5 Jun 2018 0:59 UTC
2 points
0 comments3 min readLW link

The Align­ment Newslet­ter #9: 06/​04/​18

Rohin Shah4 Jun 2018 16:00 UTC
8 points
0 comments2 min readLW link

Sim­plified Poker

Zvi4 Jun 2018 15:50 UTC
34 points
17 comments1 min readLW link
(thezvi.wordpress.com)

Teach­ing Method­olo­gies & Techniques

ChaosHufflepuff4 Jun 2018 11:33 UTC
9 points
10 comments1 min readLW link

The less­wrong slack—an in­tro­duc­tion to our regulars

Elo4 Jun 2018 6:29 UTC
29 points
2 comments6 min readLW link

Against ac­cus­ing peo­ple of motte and bailey

Kaj_Sotala3 Jun 2018 21:31 UTC
42 points
14 comments4 min readLW link

Ex­ces­sive EDA Effortposting

abstractapplic3 Jun 2018 19:17 UTC
44 points
2 comments10 min readLW link

Us­ing In­tel­lec­tual Pro­cesses to Com­bat Bias

JustinCEO3 Jun 2018 14:42 UTC
−25 points
11 comments1 min readLW link
(rationalessays.com)

Swim­ming Up­stream: A Case Study in In­stru­men­tal Rationality

TurnTrout3 Jun 2018 3:16 UTC
76 points
7 comments8 min readLW link

Trajectory

Logan Riggs2 Jun 2018 18:29 UTC
6 points
0 comments2 min readLW link

Why kids stop ask­ing why

Expipiplusone2 Jun 2018 17:03 UTC
3 points
8 comments1 min readLW link

Sleep­ing Beauty Re­solved (?) Pt. 2: Iden­tity and Betting

ksvanhorn2 Jun 2018 2:43 UTC
9 points
50 comments6 min readLW link

Three types of “should”

Sniffnoy2 Jun 2018 0:54 UTC
9 points
9 comments2 min readLW link