I resonate with that description of IRL-at-oneself. I have very much been trying to follow the reasoning process that long term planning needs to inline short-term values, and it’s been slowly increasing in how much it pays off. I also like this model of human discounting irrationality more than the smooth hyperbolic discounting one—it feels to me like, reasoning only from architectural priors, we ought to expect more exponential discounting between heavily log-quantized time units, so that rewards that are within the ten second span all evaluate the same, the 10 minute span all evaluate the same, the 100 minute span all evaluate the same, etc.
I resonate with that description of IRL-at-oneself. I have very much been trying to follow the reasoning process that long term planning needs to inline short-term values, and it’s been slowly increasing in how much it pays off. I also like this model of human discounting irrationality more than the smooth hyperbolic discounting one—it feels to me like, reasoning only from architectural priors, we ought to expect more exponential discounting between heavily log-quantized time units, so that rewards that are within the ten second span all evaluate the same, the 10 minute span all evaluate the same, the 100 minute span all evaluate the same, etc.