We’ll publish about a prototype version of this system soon, which will probably involve us settling on a name.
It’s not really about simulators, in practice you will probably use RL+imitation for the distillation step.
We’ll publish about a prototype version of this system soon, which will probably involve us settling on a name.
It’s not really about simulators, in practice you will probably use RL+imitation for the distillation step.