Rohin Shah comments on [AN #100]: What might go wrong if you learn a reward function while acting

Rohin Shah 21 May 2020 22:06 UTC
LW: 2 AF: 2
0
AF
Thank you for reading closely enough to notice the 5 characters used to mark the occasion :)