I’ve just put up this post, before having read your comment:
https://www.lesswrong.com/posts/aSXMM8QicBzTyxTj3/reflective-consistency-randomized-decisions-and-the-dangers
I think my conclusion is similar to yours above, but I consider randomized strategies in more detail, for both this problem and its variation with negated rewards.
I’ll be interested to have a look at your framework.
Yeah, agree with your analysis.
I’ve just put up this post, before having read your comment:
https://www.lesswrong.com/posts/aSXMM8QicBzTyxTj3/reflective-consistency-randomized-decisions-and-the-dangers
I think my conclusion is similar to yours above, but I consider randomized strategies in more detail, for both this problem and its variation with negated rewards.
I’ll be interested to have a look at your framework.
Yeah, agree with your analysis.