What distinction is being drawn in the wiki article between percept sequences and outcomes? The agent’s perceptions are its only clue to the outcomes it has achieved, so a utility function over outcomes reduces, via the agent’s posterior distribution of outcomes given perceptions, to one over perceptions.
ETA: I’m itching to add a {{by whom}} tag after “sometimes” and {{citation needed}} after “misinterpreted”, but I don’t think the LW wiki supports those. The implication of the sentence is that some people have interpreted the paper that way and at least one person has argued that this is incorrect, but who and where?
What distinction is being drawn in the wiki article between percept sequences and outcomes? The agent’s perceptions are its only clue to the outcomes it has achieved, so a utility function over outcomes reduces, via the agent’s posterior distribution of outcomes given perceptions, to one over perceptions.
ETA: I’m itching to add a {{by whom}} tag after “sometimes” and {{citation needed}} after “misinterpreted”, but I don’t think the LW wiki supports those. The implication of the sentence is that some people have interpreted the paper that way and at least one person has argued that this is incorrect, but who and where?