RSS

RAISE

Karma: 99

AI Safety Pr­ereq­ui­sites Course: Ba­sic ab­stract rep­re­sen­ta­tions of computation

RAISE
13 Mar 2019 19:38 UTC
32 points
2 comments1 min readLW link

IRL 1/​8: In­verse Re­in­force­ment Learn­ing and the prob­lem of degeneracy

RAISE
4 Mar 2019 13:11 UTC
23 points
2 comments1 min readLW link
(app.grasple.com)

IRL 2/​8: Miti­gat­ing de­gen­er­acy: mul­ti­ple experimentation

RAISE
11 Mar 2019 13:33 UTC
14 points
0 comments1 min readLW link
(app.grasple.com)

IDA 1-4/​14: Prob­lem Statement

RAISE
7 Mar 2019 14:02 UTC
9 points
0 comments1 min readLW link
(app.grasple.com)

IRL 5/​8: Max­i­mum Causal En­tropy IRL

RAISE
4 Apr 2019 10:53 UTC
8 points
2 comments1 min readLW link
(app.grasple.com)

IRL 7/​8: Gen­er­al­iz­ing hu­man-robot co­op­er­a­tion: Co­op­er­a­tive IRL

RAISE
15 Apr 2019 10:13 UTC
7 points
0 comments1 min readLW link
(app.grasple.com)

[Link] IDA: 11-14/​14: Fu­ture Directions

RAISE
28 Mar 2019 18:56 UTC
6 points
0 comments1 min readLW link

IRL 3/​8: Miti­gat­ing de­gen­er­acy: fea­ture matching

RAISE
18 Mar 2019 20:15 UTC
6 points
0 comments1 min readLW link
(app.grasple.com)

IRL 6/​8: Query­ing the hu­man: Ac­tive Re­ward Learning

RAISE
8 Apr 2019 13:49 UTC
4 points
0 comments1 min readLW link
(app.grasple.com)

IRL 4/​8: Max­i­mum En­tropy IRL and Bayesian IRL

RAISE
25 Mar 2019 22:07 UTC
4 points
0 comments1 min readLW link
(app.grasple.com)

[Link] IDA 9/​14: The Scheme

RAISE
21 Mar 2019 18:28 UTC
4 points
0 comments1 min readLW link

IDA 5-8/​14: Ap­proval Directed Agents

RAISE
14 Mar 2019 23:58 UTC
4 points
0 comments1 min readLW link
(app.grasple.com)

IRL 8/​8: Gen­er­a­tive Ad­ver­sar­ial Imi­ta­tion Learning

RAISE
22 Apr 2019 15:02 UTC
2 points
0 comments1 min readLW link
(app.grasple.com)