Why does the infinite limit of value learning matter if we’re doing soft optimization against a fixed utility distribution?
Sorry, I didn’t realize this and I was responding independently to Charlie Steiner.
Why does the infinite limit of value learning matter if we’re doing soft optimization against a fixed utility distribution?
Sorry, I didn’t realize this and I was responding independently to Charlie Steiner.