I’m interested in doing in-depth dialogues to find cruxes. Message me if you are interested in doing this.
I do alignment research, mostly stuff that is vaguely agent foundations. Currently doing independent alignment research on ontology identification. Formerly on Vivek’s team at MIRI.
Part 1 feels like magic. I don’t understand it at an intuitive level and so I’m kinda suspicious of it. It seems like such a powerful technique for working with KL divergences. I’ll spend some more time playing around with it. Everything else makes sense to me.
My question is how did you come up with this technique? Was “small KL inequalities can be equivalent to larger KL inequalities” a background fact that you knew beforehand? Or did you start by wanting to find a way to make the Hellinger distances work?