RSS

evhub

Karma: 463 (LW), 128 (AF)
Page 1

Risks from Learned Op­ti­miza­tion: Con­clu­sion and Re­lated Work

evhub
7 Jun 2019 19:53 UTC
52 points
0 comments6 min readLW link

De­cep­tive Alignment

evhub
5 Jun 2019 20:16 UTC
55 points
4 comments17 min readLW link

The In­ner Align­ment Problem

evhub
4 Jun 2019 1:20 UTC
60 points
13 comments13 min readLW link

Con­di­tions for Mesa-Optimization

evhub
1 Jun 2019 20:52 UTC
48 points
27 comments12 min readLW link