RSS

Vika

Karma: 1,773 (LW), 60 (AF)
AllPostsComments
NewTop
Page 1

De­sign­ing agent in­cen­tives to avoid side effects

Vika
11 Mar 2019 20:55 UTC
30 points
0 comments2 min readLW link
(medium.com)

New safety re­search agenda: scal­able agent al­ign­ment via re­ward modeling

Vika
20 Nov 2018 17:29 UTC
35 points
12 commentsLW link
(medium.com)

Dis­cus­sion on the ma­chine learn­ing ap­proach to AI safety

Vika
1 Nov 2018 20:54 UTC
26 points
3 commentsLW link

New Deep­Mind AI Safety Re­search Blog

Vika
27 Sep 2018 16:28 UTC
46 points
0 commentsLW link
(medium.com)

Speci­fi­ca­tion gam­ing ex­am­ples in AI

Vika
3 Apr 2018 12:30 UTC
74 points
3 commentsLW link