“Is it perhaps the case that having their left horns touched is painful to an Owned Thing? That having their right horns touched is pleasurable?” said the Humans.
I’d expect that, in LLMs, unlike the creatures of this story, it’s the expectation of signed reward which causes valence; I don’t think that the gradient graph “feels” like anything, but the forward pass plausibly could.
I’d expect that, in LLMs, unlike the creatures of this story, it’s the expectation of signed reward which causes valence; I don’t think that the gradient graph “feels” like anything, but the forward pass plausibly could.
(Agreed.)