RSS

In­verse Re­in­force­ment Learning

TagLast edit: 15 Apr 2021 17:44 UTC by [Error communicating with LW2 server]

Model Mis-speci­fi­ca­tion and In­verse Re­in­force­ment Learning

9 Nov 2018 15:33 UTC
30 points
3 comments16 min readLW link

Thoughts on “Hu­man-Com­pat­i­ble”

TurnTrout10 Oct 2019 5:24 UTC
58 points
35 comments5 min readLW link

Learn­ing bi­ases and re­wards simultaneously

rohinmshah6 Jul 2019 1:45 UTC
41 points
3 comments4 min readLW link

Our take on CHAI’s re­search agenda in un­der 1500 words

alexflint17 Jun 2020 12:24 UTC
95 points
19 comments5 min readLW link

Prob­lems in­te­grat­ing de­ci­sion the­ory and in­verse re­in­force­ment learning

agilecaveman8 May 2018 5:11 UTC
7 points
2 comments3 min readLW link

IRL 1/​8: In­verse Re­in­force­ment Learn­ing and the prob­lem of degeneracy

RAISE4 Mar 2019 13:11 UTC
20 points
2 comments1 min readLW link
(app.grasple.com)

Del­ega­tive In­verse Re­in­force­ment Learning

Vanessa Kosoy12 Jul 2017 12:18 UTC
15 points
0 comments16 min readLW link

[Question] Can co­her­ent ex­trap­o­lated vo­li­tion be es­ti­mated with In­verse Re­in­force­ment Learn­ing?

Jade Bishop15 Apr 2019 3:23 UTC
12 points
5 comments3 min readLW link

Co­op­er­a­tive In­verse Re­in­force­ment Learn­ing vs. Ir­ra­tional Hu­man Preferences

orthonormal18 Jun 2016 0:55 UTC
3 points
0 comments3 min readLW link

In­verse re­in­force­ment learn­ing on self, pre-on­tol­ogy-change

Stuart_Armstrong18 Nov 2015 13:23 UTC
0 points
0 comments1 min readLW link

Bi­ased re­ward-learn­ing in CIRL

Stuart_Armstrong5 Jan 2018 18:12 UTC
7 points
1 comment7 min readLW link

CIRL Wireheading

tom4everitt8 Aug 2017 6:33 UTC
3 points
0 comments2 min readLW link

(C)IRL is not solely a learn­ing process

Stuart_Armstrong15 Sep 2016 8:35 UTC
0 points
0 comments3 min readLW link

Book Re­view: Hu­man Compatible

Scott Alexander31 Jan 2020 5:20 UTC
75 points
6 comments16 min readLW link
(slatestarcodex.com)

Book re­view: Hu­man Compatible

PeterMcCluskey19 Jan 2020 3:32 UTC
37 points
2 comments5 min readLW link
(www.bayesianinvestor.com)

AXRP Epi­sode 2 - Learn­ing Hu­man Bi­ases with Ro­hin Shah

DanielFilan29 Dec 2020 20:43 UTC
11 points
0 comments35 min readLW link

My take on Michael Littman on “The HCI of HAI”

alexflint2 Apr 2021 19:51 UTC
56 points
4 comments7 min readLW link

RAISE is launch­ing their MVP

toonalfrink26 Feb 2019 11:45 UTC
67 points
1 comment1 min readLW link

Hu­man-AI Collaboration

rohinmshah22 Oct 2019 6:32 UTC
35 points
7 comments2 min readLW link
(bair.berkeley.edu)

Agents That Learn From Hu­man Be­hav­ior Can’t Learn Hu­man Values That Hu­mans Haven’t Learned Yet

steven046111 Jul 2018 2:59 UTC
27 points
11 comments1 min readLW link

Hu­mans can be as­signed any val­ues what­so­ever...

Stuart_Armstrong13 Oct 2017 11:29 UTC
13 points
6 comments4 min readLW link
No comments.