What can the prin­ci­pal-agent liter­a­ture tell us about AI risk?

apc8 Feb 2020 21:28 UTC
104 points
29 comments16 min readLW link

A Cau­tion­ary Note on Un­lock­ing the Emo­tional Brain

eapache8 Feb 2020 17:21 UTC
52 points
20 comments2 min readLW link

[Question] What is this re­view fea­ture?

Long try8 Feb 2020 15:30 UTC
1 point
1 comment1 min readLW link

Hal­i­fax SSC Meetup—FEB 8

interstice8 Feb 2020 0:45 UTC
4 points
0 comments1 min readLW link

On the falsifi­a­bil­ity of hypercomputation

jessicata7 Feb 2020 8:16 UTC
24 points
4 comments4 min readLW link
(unstableontology.com)

More write­ups!

jefftk7 Feb 2020 3:10 UTC
40 points
5 comments1 min readLW link
(www.jefftk.com)

Book Re­view: De­ci­sive by Chip and Dan Heath

Ian David Moss6 Feb 2020 20:15 UTC
4 points
0 comments2 min readLW link
(medium.com)

Bayes-Up: An App for Shar­ing Bayesian-MCQ

Louis Faucon6 Feb 2020 19:01 UTC
53 points
9 comments1 min readLW link

Mazes Se­quence Roundup: Fi­nal Thoughts and Paths Forward

Zvi6 Feb 2020 16:10 UTC
85 points
27 comments14 min readLW link1 review
(thezvi.wordpress.com)

Plau­si­bly, al­most ev­ery pow­er­ful al­gorithm would be manipulative

Stuart_Armstrong6 Feb 2020 11:50 UTC
38 points
25 comments3 min readLW link

Some quick notes on hand hygiene

willbradshaw6 Feb 2020 2:47 UTC
68 points
52 comments3 min readLW link

Po­ten­tial Re­search Topic: Vingean Reflec­tion, Value Align­ment and Aspiration

Vaughn Papenhausen6 Feb 2020 1:09 UTC
15 points
4 comments4 min readLW link

Syn­the­siz­ing am­plifi­ca­tion and debate

evhub5 Feb 2020 22:53 UTC
33 points
10 comments4 min readLW link

Wri­teup: Progress on AI Safety via Debate

5 Feb 2020 21:04 UTC
100 points
18 comments33 min readLW link

[AN #85]: The nor­ma­tive ques­tions we should be ask­ing for AI al­ign­ment, and a sur­pris­ingly good chatbot

Rohin Shah5 Feb 2020 18:20 UTC
14 points
2 comments7 min readLW link
(mailchi.mp)

The Ad­ven­ture: a new Utopia story

Stuart_Armstrong5 Feb 2020 16:50 UTC
98 points
37 comments51 min readLW link

“But that’s your job”: why or­gani­sa­tions can work

Stuart_Armstrong5 Feb 2020 12:25 UTC
76 points
12 comments4 min readLW link

Train­ing a tiny SupAmp model on easy tasks. The in­fluence of failure rate on learn­ing curves

rmoehn5 Feb 2020 7:22 UTC
5 points
0 comments1 min readLW link

Phys­i­cal al­ign­ment—do you have it? Take a minute & check.

leggi5 Feb 2020 4:02 UTC
4 points
4 comments1 min readLW link

Open & Wel­come Thread—Fe­bru­ary 2020

ryan_b4 Feb 2020 20:49 UTC
17 points
114 comments1 min readLW link

Meta-Prefer­ence Utilitarianism

B Jacobs4 Feb 2020 20:24 UTC
10 points
30 comments1 min readLW link

Philo­soph­i­cal self-ratification

jessicata3 Feb 2020 22:48 UTC
23 points
13 comments5 min readLW link
(unstableontology.com)

Twenty-three AI al­ign­ment re­search pro­ject definitions

rmoehn3 Feb 2020 22:21 UTC
23 points
0 comments6 min readLW link

Ab­sent co­or­di­na­tion, fu­ture tech­nol­ogy will cause hu­man extinction

Jeffrey Ladish3 Feb 2020 21:52 UTC
21 points
12 comments5 min readLW link

Long Now, and Cul­ture vs Artifacts

Raemon3 Feb 2020 21:49 UTC
26 points
3 comments6 min readLW link

[Question] Look­ing for books about soft­ware en­g­ineer­ing as a field

mingyuan3 Feb 2020 21:49 UTC
14 points
15 comments1 min readLW link

Cat­e­gory The­ory Without The Baggage

johnswentworth3 Feb 2020 20:03 UTC
136 points
49 comments13 min readLW link

Pro­tect­ing Large Pro­jects Against Mazedom

Zvi3 Feb 2020 17:10 UTC
76 points
11 comments4 min readLW link1 review
(thezvi.wordpress.com)

Pes­simism About Un­known Un­knowns In­spires Conservatism

michaelcohen3 Feb 2020 14:48 UTC
31 points
2 comments5 min readLW link

Map Of Effec­tive Altruism

Scott Alexander3 Feb 2020 6:20 UTC
17 points
1 comment1 min readLW link
(slatestarcodex.com)

UML IX: Ker­nels and Boosting

Rafael Harth2 Feb 2020 21:51 UTC
13 points
1 comment10 min readLW link

A point of clar­ifi­ca­tion on in­fo­haz­ard terminology

eukaryote2 Feb 2020 17:43 UTC
50 points
21 comments2 min readLW link
(eukaryotewritesblog.com)

[Question] Money isn’t real. When you donate money to a char­ity, how does it ac­tu­ally help?

Dagon2 Feb 2020 17:03 UTC
15 points
28 comments1 min readLW link

[Link] Beyond the hill: thoughts on on­tolo­gies for think­ing, es­say-com­plete­ness and fore­cast­ing

jacobjacob2 Feb 2020 12:39 UTC
33 points
6 comments1 min readLW link

The Case for Ar­tifi­cial Ex­pert In­tel­li­gence (AXI): What lies be­tween nar­row and gen­eral AI?

Yuli_Ban2 Feb 2020 5:55 UTC
8 points
2 comments6 min readLW link

“Me­mento Mori”, Said The Confessor

namespace2 Feb 2020 3:37 UTC
34 points
4 comments1 min readLW link
(www.thelastrationalist.com)

Bay Win­ter Sols­tice seat­ing-scarcity

Raemon1 Feb 2020 23:09 UTC
2 points
3 comments2 min readLW link

The case for lifel­og­ging as life extension

Matthew Barnett1 Feb 2020 21:56 UTC
48 points
17 comments3 min readLW link1 review

What Money Can­not Buy

johnswentworth1 Feb 2020 20:11 UTC
321 points
49 comments4 min readLW link1 review

Effec­tive Altru­ism QALY work­shop ma­te­ri­als & out­line (and Jan 13 ’19 meetup notes)

samstowers1 Feb 2020 4:42 UTC
10 points
1 comment3 min readLW link

More Rhythm Options

jefftk1 Feb 2020 3:10 UTC
1 point
0 comments1 min readLW link
(www.jefftk.com)

[Question] In­stru­men­tal Oc­cam?

abramdemski31 Jan 2020 19:27 UTC
30 points
15 comments1 min readLW link

REVISED: A drown­ing child is hard to find

Benquo31 Jan 2020 18:07 UTC
22 points
35 comments1 min readLW link
(benjaminrosshoffman.com)

Jan­uary 2020 gw­ern.net newsletter

gwern31 Jan 2020 18:04 UTC
19 points
0 comments1 min readLW link
(www.gwern.net)

Create a Full Alter­na­tive Stack

Zvi31 Jan 2020 17:10 UTC
79 points
14 comments6 min readLW link1 review
(thezvi.wordpress.com)

[Link] Ig­no­rance, a skil­led practice

romeostevensit31 Jan 2020 16:21 UTC
16 points
9 comments2 min readLW link

[ELDR Tac­tics] Con­sider switch­ing to (mostly) de­caf.

aaq31 Jan 2020 15:09 UTC
29 points
2 comments4 min readLW link

[Question] Ex­ist­ing work on cre­at­ing ter­minol­ogy & names?

ozziegooen31 Jan 2020 12:16 UTC
10 points
6 comments1 min readLW link

Book Re­view: Hu­man Compatible

Scott Alexander31 Jan 2020 5:20 UTC
78 points
6 comments16 min readLW link
(slatestarcodex.com)

HALIFAX SSC MEETUP—FEB. 1

interstice31 Jan 2020 3:59 UTC
4 points
0 comments1 min readLW link