To the speed section, you might want to add examples of parallel learning. Parallelizing learning of robot arm manipulation, or parallel playing of Atari games, which are both (much) faster in terms of wallclock time and also can be more sample and resource efficient (A3C actually can be more sample-efficient than DQN with multiple independent agents, and it doesn’t need to waste a great deal of RAM and computation on the experience replay).
To the speed section, you might want to add examples of parallel learning. Parallelizing learning of robot arm manipulation, or parallel playing of Atari games, which are both (much) faster in terms of wallclock time and also can be more sample and resource efficient (A3C actually can be more sample-efficient than DQN with multiple independent agents, and it doesn’t need to waste a great deal of RAM and computation on the experience replay).