Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
michaelcohen comments on
Delegative Reinforcement Learning with a Merely Sane Advisor
michaelcohen
22 Apr 2019 0:50 UTC
LW: 1 AF: 1
AF
(as opposed to standard regret bounds in RL which are only applicable in the e
pisodic
setting)
??
Back to top
??