ESRogs comments on Prize for probable problems

ESRogs 20 Mar 2018 18:50 UTC
2 points
and the runtime cost is as good as the ML algorithm you’re using to distill new agents
Why would the runtime cost be on par with the distillation cost?
- William_S 20 Mar 2018 19:20 UTC
  3 points
  Parent
  Sorry, that was a bit confusing, edited to clarify. What I mean is, you have some algorithm you’re using to implement new agents, and that algorithm has a training cost (that you pay during distillation) and a runtime cost (that you pay when you apply the agent). The runtime cost of the distilled agent can be as good as the runtime cost of an unaligned agent implemented by the same algorithm (part of Paul’s claim about being competitive with unaligned agents).