MondSemmel comments on Open & Welcome Thread—June 2022

MondSemmel 18 Jun 2022 18:21 UTC
2 points
From Algorithms to Live By, I vaguely recall the multi-armed bandit problem. Maybe that’s what you’re looking for? Or is that still too closely tied to the explore-exploit paradigm?
- Oscar_Cunningham 30 Jun 2022 14:40 UTC
  4 points
  Parent
  I got a good answer here: https://stats.stackexchange.com/q/579642/5751
- Oscar_Cunningham 18 Jun 2022 18:36 UTC
  2 points
  Parent
  Or is that still too closely tied to the explore-exploit paradigm?
  Right. The setup for my problem is the same as the ‘bernoulli bandit’, but I only care about the information and not the reward. All I see on that page is about exploration-exploitation.