Gunnar_Zarncke comments on The Winding Path

Gunnar_Zarncke 24 Nov 2015 21:54 UTC
6 points
0
To answer your last question: The thoughts ‘beautifully presented’ and ‘aesthetic’ crossed my mind before reading your question.

Also: Another thought that crossed my mind was ‘does he want to describe that rationality is kind of like human Thompson Sampling’ (or more generally explore-exploit optimization)?
- OrphanWilde 25 Nov 2015 16:20 UTC
  4 points
  0
  Parent
  I have not heard of Thompson Sampling, or explore-exploit optimization. That it’s a named phenomenon independent of what I considered to be rationality itself may be an issue; that’s more or less explicitly my own strategy and regard for rationality, which means it may not be as generalizable as I anticipated, as I’m almost certainly engaging in typical mind fallacy without realizing it there.
  - IlyaShpitser 26 Nov 2015 18:21 UTC
    5 points
    0
    Parent
    The explore-exploit tradeoff is a fundamental thing in learning in complex environments (in AI this is studied in reinforcement learning). The way this often comes up for people is when ordering food (new restaurant / old favorite, favorite order / new order).
  - Gunnar_Zarncke 25 Nov 2015 19:50 UTC
    2 points
    0
    Parent
    explore-exploit is no human strategy but a mathematical modelling of a specific optimization. Just in case that hasn’t been clear. It is just that the specific type of rationality you described could be seen as analogous to that.