I concede, RL is a prototype example of algorithmic learning problem. The exploration vs exploitation trade-off is something that needs to be addressed by RL algorithms. It is fair then to say that we gain insight into the “trade-off” by recognizing how the algorithms “solve” it.
I concede, RL is a prototype example of algorithmic learning problem. The exploration vs exploitation trade-off is something that needs to be addressed by RL algorithms. It is fair then to say that we gain insight into the “trade-off” by recognizing how the algorithms “solve” it.
It is also fair to say there is an abstract concept of ‘trade off’ that is not itself algorithmic.