I think this is caused by mistaking a 3-place word for a 2-place word. “Behind” takes something like arguments ‘current state’, ‘value function’, ‘schedule distribution’. I think you’ve misplaced the schedule distribution that’s supposed to go here, and are using some silly replacement, because you forgot it was an argument that mattered.
I think this is caused by mistaking a 3-place word for a 2-place word. “Behind” takes something like arguments ‘current state’, ‘value function’, ‘schedule distribution’. I think you’ve misplaced the schedule distribution that’s supposed to go here, and are using some silly replacement, because you forgot it was an argument that mattered.