RSS
Page 1

Alignment Newsletter #28

rohinmshah
15 Oct 2018 21:20 UTC
11 points
0 comments8 min readLW link

Standard ML Oracles vs Counterfactual ones

Stuart_Armstrong
10 Oct 2018 20:01 UTC
15 points
3 comments6 min readLW link

A Rationality Condition for CDT Is That It Equal EDT (Part 2)

abramdemski
9 Oct 2018 5:41 UTC
17 points
0 comments7 min readLW link

Alignment Newsletter #27

rohinmshah
9 Oct 2018 1:10 UTC
16 points
0 comments9 min readLW link

A Rationality Condition for CDT Is That It Equal EDT (Part 1)

abramdemski
4 Oct 2018 4:32 UTC
21 points
14 comments9 min readLW link

The Rocket Alignment Problem

Eliezer_Yudkowsky
4 Oct 2018 0:38 UTC
123 points
26 comments15 min readLW link

Alignment Newsletter #26

rohinmshah
2 Oct 2018 16:10 UTC
14 points
0 comments7 min readLW link

EDT solves 5 and 10 with conditional oracles

jessica.liu.taylor
30 Sep 2018 7:57 UTC
60 points
7 comments13 min readLW link

New DeepMind AI Safety Research Blog

Vika
27 Sep 2018 16:28 UTC
46 points
0 comments1 min readLW link
(medium.com)

Asymptotic Decision Theory (Improved Writeup)

Diffractor
27 Sep 2018 5:17 UTC
27 points
10 comments13 min readLW link

Wireheading as a potential problem with the new impact measure

Stuart_Armstrong
25 Sep 2018 14:15 UTC
25 points
20 comments4 min readLW link

Alignment Newsletter #25

rohinmshah
24 Sep 2018 16:10 UTC
22 points
3 comments9 min readLW link

Bridging syntax and semantics with Quine’s Gavagai

Stuart_Armstrong
24 Sep 2018 14:39 UTC
20 points
2 comments2 min readLW link

Reflective AIXI and Anthropics

Diffractor
24 Sep 2018 2:15 UTC
19 points
13 comments11 min readLW link

In Logical Time, All Games are Iterated Games

abramdemski
20 Sep 2018 2:01 UTC
79 points
8 comments5 min readLW link

Bridging syntax and semantics, empirically

Stuart_Armstrong
19 Sep 2018 16:48 UTC
17 points
0 comments5 min readLW link

Web of connotations: Bleggs, Rubes, thermostats and beliefs

Stuart_Armstrong
19 Sep 2018 16:47 UTC
20 points
0 comments8 min readLW link

Towards a New Impact Measure

TurnTrout
18 Sep 2018 17:21 UTC
104 points
130 comments33 min readLW link

Alignment Newsletter #24

rohinmshah
17 Sep 2018 16:20 UTC
10 points
3 comments12 min readLW link

(A → B) → A

Scott Garrabrant
11 Sep 2018 22:38 UTC
42 points
9 comments4 min readLW link