Un­der­stand­ing Iter­ated Distil­la­tion and Am­plifi­ca­tion: Claims and Oversight

William_S17 Apr 2018 22:36 UTC
34 points
30 comments9 min readLW link

Multi-win­ner Vot­ing: a ques­tion of Alignment

Jameson Quinn17 Apr 2018 18:51 UTC
42 points
19 comments12 min readLW link

Com­pu­ta­tional Mo­ral­ity (Part 1) - a Pro­posed Solution

David Cooper17 Apr 2018 0:09 UTC
−5 points
47 comments9 min readLW link

Believ­able Promises

Douglas_Reay16 Apr 2018 16:17 UTC
5 points
0 comments5 min readLW link

Good News for Immunostimulants

sarahconstantin16 Apr 2018 16:10 UTC
26 points
9 comments2 min readLW link
(srconstantin.wordpress.com)

The Align­ment Newslet­ter #2: 04/​16/​18

Rohin Shah16 Apr 2018 16:00 UTC
8 points
0 comments5 min readLW link

[Preprint for com­ment­ing] Fight­ing Aging as an Effec­tive Altru­ism Cause

avturchin16 Apr 2018 13:55 UTC
9 points
5 comments1 min readLW link

An­nounce­ment: AI al­ign­ment prize round 2 win­ners and next round

cousin_it16 Apr 2018 3:08 UTC
64 points
29 comments2 min readLW link

Re­mem­ber­ing the pass­ing of Kathy Forth.

Elo16 Apr 2018 1:53 UTC
38 points
21 comments1 min readLW link

The Case Against Education

Zvi15 Apr 2018 12:30 UTC
77 points
44 comments7 min readLW link
(thezvi.wordpress.com)

You Are Be­ing Underpaid

RobertM15 Apr 2018 6:28 UTC
29 points
16 comments8 min readLW link

Dire Bullshit

Alicorn15 Apr 2018 4:59 UTC
98 points
5 comments2 min readLW link

Raven Para­dox Revisited

Chris_Leong15 Apr 2018 0:08 UTC
5 points
12 comments2 min readLW link

A Differ­ent Pri­soner’s Dilemma

Serpent-Stare14 Apr 2018 15:54 UTC
9 points
1 comment5 min readLW link

How Go­ing Meta Can Level Up Your Career

Matt Goldenberg14 Apr 2018 2:13 UTC
24 points
7 comments7 min readLW link

Ti­mothy Chu Ori­gins Chap­ter 1

alkjash13 Apr 2018 18:40 UTC
19 points
2 comments6 min readLW link
(radimentary.wordpress.com)

5 gen­eral vot­ing patholo­gies: lesser names of Moloch

Jameson Quinn13 Apr 2018 18:38 UTC
72 points
16 comments10 min readLW link

Death in Groups II

ryan_b13 Apr 2018 18:12 UTC
14 points
4 comments6 min readLW link

Im­plicit extortion

paulfchristiano13 Apr 2018 16:33 UTC
29 points
16 comments6 min readLW link
(ai-alignment.com)

Utility ver­sus Re­ward func­tion: par­tial equivalence

Stuart_Armstrong13 Apr 2018 14:58 UTC
18 points
5 comments5 min readLW link

Recom­men­da­tions vs. Guidelines

Scott Alexander13 Apr 2018 4:10 UTC
72 points
18 comments5 min readLW link
(slatestarcodex.com)

Have you con­sid­ered ei­ther a Kick­starter or a Pa­treon?

Chris_Leong13 Apr 2018 1:09 UTC
7 points
0 comments1 min readLW link

Idea: OpenAI Gym en­vi­ron­ments where the AI is a part of the environment

philip_b12 Apr 2018 22:28 UTC
4 points
5 comments1 min readLW link

Metamorphosis

Douglas_Reay12 Apr 2018 21:53 UTC
2 points
0 comments4 min readLW link

LW Up­date 4/​12/​2018 – Front­page Meetups

Raemon12 Apr 2018 21:04 UTC
8 points
2 comments1 min readLW link

Ra­tion­al­ity Vienna Meetup, April 2018

Viliam12 Apr 2018 19:41 UTC
4 points
2 comments1 min readLW link

A vot­ing the­ory primer for rationalists

Jameson Quinn12 Apr 2018 15:15 UTC
229 points
98 comments17 min readLW link2 reviews

Com­put­ing an ex­act quan­tilal policy

Vanessa Kosoy12 Apr 2018 9:23 UTC
9 points
0 comments2 min readLW link

Quan­tilal con­trol for finite MDPs

Vanessa Kosoy12 Apr 2018 9:21 UTC
14 points
0 comments13 min readLW link

Wash­ing­ton, D.C.: Create & Complete

RobinZ12 Apr 2018 3:45 UTC
4 points
0 comments1 min readLW link

I’m go­ing to help you quit Face­book with some science

Elo12 Apr 2018 3:09 UTC
30 points
11 comments5 min readLW link

Two clar­ifi­ca­tions about “Strate­gic Back­ground”

Rob Bensinger12 Apr 2018 2:11 UTC
47 points
6 comments1 min readLW link

The Chro­matic Num­ber of the Plane is at Least 5 - Aubrey de Grey

Scott Garrabrant11 Apr 2018 18:19 UTC
61 points
5 comments1 min readLW link
(arxiv.org)

What I got out of ‘Al­gorithms to Live By’

rk11 Apr 2018 16:35 UTC
26 points
1 comment19 min readLW link

Us­ing ra­tio­nal­ity to de­bug Ma­chine Learning

Dr_Manhattan10 Apr 2018 20:03 UTC
20 points
3 comments1 min readLW link
(amid.fish)

The limits of corrigibility

Stuart_Armstrong10 Apr 2018 10:49 UTC
27 points
9 comments4 min readLW link

Trust­wor­thy Computing

Douglas_Reay10 Apr 2018 7:55 UTC
9 points
1 comment6 min readLW link

An­nounc­ing the Align­ment Newsletter

Rohin Shah9 Apr 2018 21:16 UTC
29 points
3 comments1 min readLW link

OpenAI charter

wunan9 Apr 2018 21:02 UTC
17 points
2 comments1 min readLW link
(blog.openai.com)

Role-play­ing game based on HPMOR

Alexander2309 Apr 2018 18:19 UTC
3 points
0 comments1 min readLW link

The Align­ment Newslet­ter #1: 04/​09/​18

Rohin Shah9 Apr 2018 16:00 UTC
12 points
3 comments4 min readLW link

Trust the (lo­cal) expert

rk9 Apr 2018 10:22 UTC
14 points
12 comments7 min readLW link

How do we change our minds? A meetup blueprint

ChristianKl9 Apr 2018 7:58 UTC
10 points
0 comments4 min readLW link

Why fo­cus on AI?

nomore8 Apr 2018 22:58 UTC
0 points
6 comments1 min readLW link

Cri­tique my Model: The EV of AGI to Selfish Individuals

ozziegooen8 Apr 2018 20:04 UTC
19 points
9 comments4 min readLW link

BYOL (Buy Your Own Lunch)

JohnGreer8 Apr 2018 19:32 UTC
1 point
8 comments2 min readLW link

I de­sire U, grpfrt, but I won’t eat U.

Jacob Falkovich8 Apr 2018 19:19 UTC
6 points
1 comment1 min readLW link

GreaterWrong—more new fea­tures & enhancements

Said Achmiz7 Apr 2018 20:41 UTC
9 points
1 comment1 min readLW link

Mean­ing and Mo­ral Foun­da­tions Theory

bryjnar7 Apr 2018 17:59 UTC
14 points
8 comments2 min readLW link

No Con­stant Distri­bu­tion Can be a Log­i­cal Inductor

Diffractor7 Apr 2018 9:09 UTC
16 points
1 comment2 min readLW link