It’s not quite all about the entropy term; it’s the KL-div term that determines which value x∗ is chosen. But you are correct insofar as this is not intended to be analogous to bias/variance tradeoff, and it’s not really about “finding a balance point” between the two terms.
It’s not quite all about the entropy term; it’s the KL-div term that determines which value x∗ is chosen. But you are correct insofar as this is not intended to be analogous to bias/variance tradeoff, and it’s not really about “finding a balance point” between the two terms.