Also, if you define the state to be the entire history, you lose ergodicity assumptions that are needed to prove that algorithms can learn well.
Also, if you define the state to be the entire history, you lose ergodicity assumptions that are needed to prove that algorithms can learn well.