(I have no idea what y’all are using KL-divergence for, so I have no opinion about whether you should have been using it in this theorem.)
(I have no idea what y’all are using KL-divergence for, so I have no opinion about whether you should have been using it in this theorem.)