helpful, especially the part connecting the training loss to simply predicting the added noise. I’d previously seen diffusion described as “start from noise and denoise step by step,” but this makes the MSE objective feel much less mysterious.
helpful, especially the part connecting the training loss to simply predicting the added noise. I’d previously seen diffusion described as “start from noise and denoise step by step,” but this makes the MSE objective feel much less mysterious.