Page 1

Align­ment Re­search Field Guide

abramdemski
8 Mar 2019 19:57 UTC
180 points
4 comments17 min readLW link

Per­son­al­ized Medicine For Real

sarahconstantin
4 Mar 2019 22:40 UTC
177 points
12 comments5 min readLW link

“Other peo­ple are wrong” vs “I am right”

Buck
22 Feb 2019 20:01 UTC
163 points
11 comments9 min readLW link

Build­ing up to an In­ter­nal Fam­ily Sys­tems model

Kaj_Sotala
26 Jan 2019 12:25 UTC
138 points
27 comments28 min readLW link

More re­al­is­tic tales of doom

paulfchristiano
17 Mar 2019 20:18 UTC
135 points
18 comments8 min readLW link

The 3 Books Tech­nique for Learn­ing a New Skilll

mr-hire
9 Jan 2019 12:45 UTC
125 points
23 commentsLW link

Disen­tan­gling ar­gu­ments for the im­por­tance of AI safety

ricraz
21 Jan 2019 12:41 UTC
112 points
21 comments8 min readLW link

Thoughts on Hu­man Models

xrchz
21 Feb 2019 9:10 UTC
108 points
17 comments10 min readLW link

Some Thoughts on My Psy­chi­a­try Practice

Laura B
16 Jan 2019 23:16 UTC
106 points
31 commentsLW link

Hu­mans Who Are Not Con­cen­trat­ing Are Not Gen­eral Intelligences

sarahconstantin
25 Feb 2019 20:40 UTC
98 points
13 comments6 min readLW link

Rule Thinkers In, Not Out

Scott Alexander
27 Feb 2019 2:40 UTC
98 points
41 comments4 min readLW link

Karma-Change Notifications

jimrandomh
2 Mar 2019 2:52 UTC
91 points
42 comments1 min readLW link

Refram­ing Su­per­in­tel­li­gence: Com­pre­hen­sive AI Ser­vices as Gen­eral Intelligence

rohinmshah
8 Jan 2019 7:12 UTC
90 points
67 comments5 min readLW link
(www.fhi.ox.ac.uk)

Rest Days vs Re­cov­ery Days

Unreal
19 Mar 2019 22:37 UTC
90 points
15 comments6 min readLW link

Prob­a­bil­ity space has 2 metrics

Donald Hobson
10 Feb 2019 0:28 UTC
88 points
11 commentsLW link

S-Curves for Trend Forecasting

mr-hire
23 Jan 2019 18:17 UTC
87 points
7 commentsLW link

Book Sum­mary: Con­scious­ness and the Brain

Kaj_Sotala
16 Jan 2019 14:43 UTC
86 points
13 commentsLW link

RAISE is launch­ing their MVP

toonalfrink
26 Feb 2019 11:45 UTC
85 points
1 comment1 min readLW link

Less Com­pe­ti­tion, More Mer­i­toc­racy?

Zvi
20 Jan 2019 2:00 UTC
81 points
13 commentsLW link

An­nounce­ment: AI al­ign­ment prize round 4 winners

cousin_it
20 Jan 2019 14:46 UTC
80 points
37 commentsLW link

Se­quence in­tro­duc­tion: non-agent and mul­ti­a­gent mod­els of mind

Kaj_Sotala
7 Jan 2019 14:12 UTC
79 points
4 commentsLW link

From Per­sonal to Pri­son Gangs: En­forc­ing Proso­cial Behavior

johnswentworth
24 Jan 2019 18:07 UTC
78 points
8 commentsLW link

Privacy

Zvi
15 Mar 2019 20:20 UTC
76 points
72 comments6 min readLW link

Book Re­view: The Struc­ture Of Scien­tific Revolutions

Scott Alexander
9 Jan 2019 7:10 UTC
75 points
24 commentsLW link

[Question] Why is so much dis­cus­sion hap­pen­ing in pri­vate Google Docs?

Wei_Dai
12 Jan 2019 2:19 UTC
74 points
21 commentsLW link

Strat­egy is the De­con­fu­sion of Action

ryan_b
2 Jan 2019 20:56 UTC
73 points
4 commentsLW link

Three ways that “Suffi­ciently op­ti­mized agents ap­pear co­her­ent” can be false

Wei_Dai
5 Mar 2019 21:52 UTC
68 points
2 comments3 min readLW link

Blackmail

Zvi
19 Feb 2019 3:50 UTC
67 points
44 comments15 min readLW link

Ac­tive Cu­ri­os­ity vs Open Curiosity

Unreal
15 Mar 2019 16:54 UTC
67 points
18 comments3 min readLW link

[Question] How does Gra­di­ent Des­cent In­ter­act with Good­hart?

Scott Garrabrant
2 Feb 2019 0:14 UTC
66 points
15 commentsLW link

Epistemic Tenure

Scott Garrabrant
18 Feb 2019 22:56 UTC
66 points
27 comments3 min readLW link

Pavlov Generalizes

abramdemski
20 Feb 2019 9:03 UTC
66 points
2 comments7 min readLW link

In My Culture

Duncan_Sabien
7 Mar 2019 7:22 UTC
65 points
49 comments1 min readLW link
(medium.com)

[Question] Does anti-malaria char­ity de­stroy the lo­cal anti-malaria in­dus­try?

Viliam
5 Jan 2019 19:04 UTC
64 points
16 commentsLW link

Com­ments on CAIS

ricraz
12 Jan 2019 15:20 UTC
63 points
12 commentsLW link

The Case for a Big­ger Audience

John_Maxwell_IV
9 Feb 2019 7:22 UTC
63 points
58 commentsLW link

“AlphaS­tar: Mas­ter­ing the Real-Time Strat­egy Game StarCraft II”, Deep­Mind [won 10 of 11 games against hu­man pros]

gwern
24 Jan 2019 20:49 UTC
62 points
52 commentsLW link
(deepmind.com)

How the MtG Color Wheel Ex­plains AI Safety

Scott Garrabrant
15 Feb 2019 23:42 UTC
62 points
4 comments6 min readLW link

Policy-Based vs Willpower-Based Intentions

Unreal
28 Feb 2019 5:17 UTC
61 points
14 comments4 min readLW link

[Question] What are the open prob­lems in Hu­man Ra­tion­al­ity?

Raemon
13 Jan 2019 4:46 UTC
60 points
43 commentsLW link

Learn­ing-In­ten­tions vs Do­ing-In­ten­tions

Ruby
1 Jan 2019 22:22 UTC
58 points
14 commentsLW link

The Re­la­tion­ship Between Hier­ar­chy and Wealth

sarahconstantin
23 Jan 2019 2:00 UTC
58 points
8 commentsLW link

Subagents, in­tro­spec­tive aware­ness, and blending

Kaj_Sotala
2 Mar 2019 12:53 UTC
58 points
16 comments9 min readLW link

Two More De­ci­sion The­ory Prob­lems for Humans

Wei_Dai
4 Jan 2019 9:00 UTC
57 points
12 comments2 min readLW link

Me­gapro­ject management

ryan_b
11 Jan 2019 17:08 UTC
57 points
8 commentsLW link

When to use quantilization

RyanCarey
5 Feb 2019 17:17 UTC
56 points
5 commentsLW link

Plans are Re­cur­sive & Why This is Important

Ruby
10 Mar 2019 1:58 UTC
56 points
8 comments11 min readLW link

Com­bat vs Nur­ture & Meta-Contrarianism

abramdemski
10 Jan 2019 23:17 UTC
55 points
7 commentsLW link

Com­par­i­son of de­ci­sion the­o­ries (with a fo­cus on log­i­cal-coun­ter­fac­tual de­ci­sion the­o­ries)

riceissa
16 Mar 2019 21:15 UTC
55 points
9 comments10 min readLW link

Some Thoughts on Metaphilosophy

Wei_Dai
10 Feb 2019 0:28 UTC
54 points
24 comments4 min readLW link