Wouldn’t this take an enormous amount of observation time to generate enough data to learn a human-imitating policy?
Yes, although we could start now.
Also, I just wanted to give the simplest possible proposal. More reasonably, data like this could probably be gathered in many ways.
Yes, although we could start now.
Also, I just wanted to give the simplest possible proposal. More reasonably, data like this could probably be gathered in many ways.