DragonGod comments on Beren’s “Deconfusing Direct vs Amortised Optimisation”

DragonGod 5 Apr 2023 19:37 UTC
5 points
4
I think amortised optimisation doesn’t lie on the same spectrum as “quantiliser - (direct) optimiser” but is another dimension entirely. I.e. your question is like asking: “where between the x and y axis does the line for the z axis lie”?

Amortised optimisation is just a fundamentally different approach where we learn to approximate some function from a dataset and then just evaluate the learned function.

The behaviour of the amortised policy may look similar to a direct optimiser on the training distribution, but diverge arbitrarily far on another distribution where the correlation between the learned policy and a particular objective breaks down.