RSS

Yes Re­quires the Pos­si­bil­ity of No

Scott Garrabrant17 May 2019 22:39 UTC
261 points
55 comments2 min readLW link2 reviews

The Parable of Pre­dict-O-Matic

abramdemski15 Oct 2019 0:49 UTC
342 points
41 comments14 min readLW link2 reviews

Hu­mans Who Are Not Con­cen­trat­ing Are Not Gen­eral Intelligences

sarahconstantin25 Feb 2019 20:40 UTC
186 points
35 comments6 min readLW link1 review
(srconstantin.wordpress.com)

Un­der­stand­ing “Deep Dou­ble Des­cent”

evhub6 Dec 2019 0:00 UTC
149 points
51 comments5 min readLW link4 reviews

Book Re­view: The Se­cret Of Our Success

Scott Alexander5 Jun 2019 6:50 UTC
158 points
19 comments25 min readLW link2 reviews
(slatestarcodex.com)

Moloch Hasn’t Won

Zvi28 Dec 2019 16:30 UTC
179 points
40 comments7 min readLW link1 review
(thezvi.wordpress.com)

The Power to De­mol­ish Bad Arguments

Liron2 Sep 2019 12:57 UTC
97 points
83 comments11 min readLW link6 reviews

Be­ing the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC
405 points
57 comments3 min readLW link3 reviews

Asym­met­ric Justice

Zvi25 Apr 2019 16:00 UTC
230 points
101 comments5 min readLW link2 reviews
(thezvi.wordpress.com)

Notic­ing Frame Differences

Raemon30 Sep 2019 1:24 UTC
210 points
39 comments9 min readLW link2 reviews

Ex­cerpts from a larger dis­cus­sion about simulacra

Benquo10 Apr 2019 21:27 UTC
53 points
40 comments6 min readLW link5 reviews
(benjaminrosshoffman.com)

Co­her­ent de­ci­sions im­ply con­sis­tent utilities

Eliezer Yudkowsky12 May 2019 21:33 UTC
148 points
81 comments26 min readLW link3 reviews

Risks from Learned Op­ti­miza­tion: Introduction

31 May 2019 23:44 UTC
184 points
42 comments12 min readLW link3 reviews

Rule Thinkers In, Not Out

Scott Alexander27 Feb 2019 2:40 UTC
221 points
67 comments4 min readLW link4 reviews
(slatestarcodex.com)

The Curse Of The Counterfactual

pjeby1 Nov 2019 18:34 UTC
124 points
34 comments19 min readLW link1 review

From Per­sonal to Pri­son Gangs: En­forc­ing Proso­cial Behavior

johnswentworth24 Jan 2019 18:07 UTC
146 points
26 comments5 min readLW link2 reviews

In­tegrity and ac­countabil­ity are core parts of rationality

habryka15 Jul 2019 20:22 UTC
153 points
66 comments6 min readLW link1 review

Selec­tion vs Control

abramdemski2 Jun 2019 7:01 UTC
168 points
25 comments11 min readLW link2 reviews

Align­ment Re­search Field Guide

abramdemski8 Mar 2019 19:57 UTC
264 points
9 comments17 min readLW link2 reviews

Tal Yarkoni: No, it’s not The In­cen­tives—it’s you

Zack_M_Davis11 Jun 2019 7:09 UTC
89 points
119 comments1 min readLW link4 reviews
(www.talyarkoni.org)