RSS

Vika

Karma: 1,805 (LW), 73 (AF)
Page 1

De­sign­ing agent in­cen­tives to avoid side effects

Vika
11 Mar 2019 20:55 UTC
31 points
0 comments2 min readLW link
(medium.com)

New safety re­search agenda: scal­able agent al­ign­ment via re­ward modeling

Vika
20 Nov 2018 17:29 UTC
35 points
12 comments1 min readLW link
(medium.com)

Dis­cus­sion on the ma­chine learn­ing ap­proach to AI safety

Vika
1 Nov 2018 20:54 UTC
26 points
3 comments4 min readLW link

New Deep­Mind AI Safety Re­search Blog

Vika
27 Sep 2018 16:28 UTC
46 points
0 comments1 min readLW link
(medium.com)