RSS
Page 1

COEDT Equil­ibria in Games

Diffractor
6 Dec 2018 18:00 UTC
8 points
0 comments3 min readLW link

Why we need a *the­ory* of hu­man values

Stuart_Armstrong
5 Dec 2018 16:00 UTC
48 points
1 comment4 min readLW link

Fac­tored Cognition

stuhlmueller
5 Dec 2018 1:01 UTC
28 points
3 comments17 min readLW link

Align­ment Newslet­ter #35

rohinmshah
4 Dec 2018 1:10 UTC
15 points
0 comments6 min readLW link

Co­her­ence ar­gu­ments do not im­ply goal-di­rected behavior

rohinmshah
3 Dec 2018 3:26 UTC
45 points
20 comments7 min readLW link

Benign model-free RL

paulfchristiano
2 Dec 2018 4:10 UTC
10 points
0 comments7 min readLW link

In­tu­itions about goal-di­rected behavior

rohinmshah
1 Dec 2018 4:25 UTC
28 points
10 comments6 min readLW link

Iter­ated Distil­la­tion and Amplification

Ajeya Cotra
30 Nov 2018 4:47 UTC
19 points
6 comments6 min readLW link

For­mal Open Prob­lem in De­ci­sion Theory

Scott Garrabrant
29 Nov 2018 3:25 UTC
29 points
5 comments4 min readLW link

Reflec­tive or­a­cles as a solu­tion to the con­verse Law­vere problem

SamEisenstat
29 Nov 2018 3:23 UTC
18 points
0 comments7 min readLW link

The Ubiquitous Con­verse Law­vere Problem

Scott Garrabrant
29 Nov 2018 3:16 UTC
18 points
0 comments2 min readLW link

Hyper­real Brouwer

Scott Garrabrant
29 Nov 2018 3:15 UTC
24 points
0 comments6 min readLW link

Or­a­cle In­duc­tion Proofs

Diffractor
28 Nov 2018 8:12 UTC
6 points
0 comments9 min readLW link

Bounded Or­a­cle Induction

Diffractor
28 Nov 2018 8:11 UTC
29 points
0 comments9 min readLW link

Corrigibility

paulfchristiano
27 Nov 2018 21:50 UTC
29 points
1 comment6 min readLW link

Align­ment Newslet­ter #34

rohinmshah
26 Nov 2018 23:10 UTC
26 points
0 comments10 min readLW link

Hu­mans Con­sult­ing HCH

paulfchristiano
25 Nov 2018 23:18 UTC
19 points
8 comments1 min readLW link

Ap­proval-di­rected bootstrapping

paulfchristiano
25 Nov 2018 23:18 UTC
18 points
0 comments1 min readLW link

Fixed Point Discussion

Scott Garrabrant
24 Nov 2018 20:53 UTC
30 points
1 comment4 min readLW link

Ap­proval-di­rected agents: details

paulfchristiano
23 Nov 2018 23:26 UTC
18 points
1 comment7 min readLW link