Rohin Shah comments on Full toy model for preference learning

Rohin Shah 11 Nov 2019 5:44 UTC
LW: 4 AF: 3
0
AF
Planned summary:
This post applies Stuart’s general preference learning algorithm to a toy environment in which a robot has a mishmash of preferences about how to classify and bin two types of objects.
Planned opinion:
This is a nice illustration of the very abstract algorithm proposed before; I’d love it if more people illustrated their algorithms this way.