The Align­ment Newslet­ter #11: 06/​18/​18

Rohin Shah18 Jun 2018 16:00 UTC
8 points
0 comments10 min readLW link

Why Destruc­tive Value Cap­ture?

Zvi18 Jun 2018 12:20 UTC
29 points
13 comments4 min readLW link
(thezvi.wordpress.com)

Pri­vacy: Defin­ing Yourself

Lulie18 Jun 2018 2:15 UTC
37 points
5 comments3 min readLW link

In Defense of Am­bigu­ous Problems

Chris_Leong17 Jun 2018 7:40 UTC
6 points
6 comments2 min readLW link

Fun­da­men­tals of For­mal­i­sa­tion Level 4: For­mal Se­man­tics Basics

philip_b16 Jun 2018 19:09 UTC
12 points
0 comments1 min readLW link

Us­ing the uni­ver­sal prior for log­i­cal uncertainty

cousin_it16 Jun 2018 14:11 UTC
0 points
0 comments1 min readLW link
(www.greaterwrong.com)

How many philoso­phers ac­cept the or­thog­o­nal­ity the­sis ? Ev­i­dence from the PhilPapers survey

Paperclip Minimizer16 Jun 2018 12:11 UTC
3 points
26 comments3 min readLW link

Wor­ry­ing about the Vase: Whitelisting

TurnTrout16 Jun 2018 2:17 UTC
73 points
26 comments11 min readLW link

Merg­ing accounts

Chris_Leong16 Jun 2018 0:45 UTC
5 points
2 comments1 min readLW link

The Cu­ri­ous Pri­soner Puzzle

Chris_Leong16 Jun 2018 0:40 UTC
4 points
14 comments1 min readLW link

Ge­offrey Miller on Effec­tive Altru­ism and Rationality

Jacob Falkovich15 Jun 2018 17:05 UTC
19 points
0 comments1 min readLW link
(putanumonit.com)

Aligned AI May Depend on Mo­ral Facts

Gordon Seidoh Worley15 Jun 2018 1:33 UTC
8 points
11 comments1 min readLW link

SIAM Lec­ture: How Para­doxes Shape Math­e­mat­ics and Give Us Self-Ver­ify­ing Com­puter Programs

ldsrrs14 Jun 2018 20:58 UTC
3 points
0 comments1 min readLW link
(meetings.siam.org)

We Agree: Speeches All Around!

JohnBuridan14 Jun 2018 17:53 UTC
37 points
19 comments2 min readLW link

Weak ar­gu­ments against the uni­ver­sal prior be­ing malign

X4vier14 Jun 2018 17:11 UTC
50 points
23 comments3 min readLW link

Notes on a re­cent wave of spam

rossry14 Jun 2018 15:39 UTC
11 points
2 comments1 min readLW link

Log­i­cal In­duc­tor Tiling and Why it’s Hard

Diffractor14 Jun 2018 6:34 UTC
3 points
0 comments12 min readLW link

Wash­ing­ton, D.C.: Anxiety

RobinZ14 Jun 2018 1:17 UTC
15 points
0 comments1 min readLW link

An­throp­ics made easy?

Stuart_Armstrong14 Jun 2018 0:56 UTC
32 points
61 comments6 min readLW link

Coun­ter­fac­tual Mug­ging Poker Game

Scott Garrabrant13 Jun 2018 23:34 UTC
111 points
3 comments1 min readLW link

On the Chatham House Rule

Scott Garrabrant13 Jun 2018 21:41 UTC
66 points
25 comments4 min readLW link1 review

[Math] Towards Proof Writ­ing as a Skill In Itself

Andrew Quinn13 Jun 2018 4:39 UTC
25 points
8 comments2 min readLW link

To­day a Tragedy

Logan Riggs13 Jun 2018 1:58 UTC
54 points
17 comments1 min readLW link

Episte­molog­i­cal Braces

musicmage411412 Jun 2018 22:01 UTC
1 point
2 comments6 min readLW link

LW Up­date 2018-06-11 – Vul­can Re­fac­tor, Karma Over­haul, Colored Links, Moder­a­tion Log

Raemon12 Jun 2018 0:49 UTC
32 points
34 comments3 min readLW link

Ad­miring the Guts of Things.

Melkor11 Jun 2018 23:12 UTC
21 points
1 comment3 min readLW link

A gen­eral model of safety-ori­ented AI development

Wei Dai11 Jun 2018 21:00 UTC
65 points
8 comments1 min readLW link

Thoughts on the In­ner Bruce

LeoHolman11 Jun 2018 20:18 UTC
12 points
2 comments3 min readLW link

An­nounc­ing the sec­ond AI Safety Camp

Lachouette11 Jun 2018 18:59 UTC
34 points
0 comments1 min readLW link

The Align­ment Newslet­ter #10: 06/​11/​18

Rohin Shah11 Jun 2018 16:00 UTC
16 points
0 comments9 min readLW link

Front Row Center

Zvi11 Jun 2018 13:50 UTC
30 points
12 comments2 min readLW link
(thezvi.wordpress.com)

A Loop­hole for Self-Ap­plica­tive Soundness

Diffractor11 Jun 2018 7:57 UTC
2 points
4 comments2 min readLW link

AI and the pa­per­clip prob­lem (or: Economist solves con­trol prob­lem with one weird trick!)

fortyeridania11 Jun 2018 2:19 UTC
10 points
4 comments1 min readLW link
(voxeu.org)

Oops on Com­mod­ity Prices

sarahconstantin10 Jun 2018 15:40 UTC
148 points
8 comments2 min readLW link
(srconstantin.wordpress.com)

Re­solv­ing the Dr Evil Problem

Chris_Leong10 Jun 2018 11:56 UTC
10 points
8 comments3 min readLW link

Sim­plified Poker Conclusions

Zvi9 Jun 2018 21:50 UTC
64 points
2 comments5 min readLW link
(thezvi.wordpress.com)

Fun­da­men­tals of For­mal­i­sa­tion Level 3: Set The­o­retic Re­la­tions and Enumerability

philip_b9 Jun 2018 19:57 UTC
16 points
0 comments1 min readLW link

Un­rav­el­ing the Failure’s Try

LeoHolman9 Jun 2018 14:34 UTC
9 points
11 comments2 min readLW link

Physics has laws, the Uni­verse might not

shminux9 Jun 2018 5:33 UTC
25 points
23 comments3 min readLW link

Could we send a mes­sage to the dis­tant fu­ture?

paulfchristiano9 Jun 2018 4:27 UTC
37 points
23 comments3 min readLW link

RFC: Meta-eth­i­cal un­cer­tainty in AGI alignment

Gordon Seidoh Worley8 Jun 2018 20:56 UTC
16 points
6 comments3 min readLW link

De­scribing LessWrong in one paragraph

ChristianKl8 Jun 2018 20:54 UTC
16 points
6 comments1 min readLW link

Quan­tum AI Goal

Gurkenglas8 Jun 2018 16:55 UTC
−1 points
5 comments1 min readLW link

Quan­tum AI Box

Gurkenglas8 Jun 2018 16:20 UTC
4 points
15 comments1 min readLW link

Effec­tive Altru­ism as Global Catas­tro­phe Mitigation

Evan_Gaensbauer8 Jun 2018 4:17 UTC
9 points
0 comments22 min readLW link

Poker ex­am­ple: (not) de­duc­ing some­one’s preferences

Stuart_Armstrong8 Jun 2018 3:19 UTC
16 points
5 comments3 min readLW link

The In­co­her­ence of Honesty

Gordon Seidoh Worley8 Jun 2018 2:28 UTC
20 points
16 comments3 min readLW link

Reflec­tions on Berkeley REACH

stardust8 Jun 2018 0:02 UTC
123 points
9 comments14 min readLW link

Beyond Astro­nom­i­cal Waste

Wei Dai7 Jun 2018 21:04 UTC
125 points
41 comments3 min readLW link

The first AI Safety Camp & onwards

Remmelt7 Jun 2018 20:13 UTC
46 points
0 comments8 min readLW link