gwern comments on LLMs, Batches, and Emergent Episodic Memory