Example 4: “I will lose more of the me and more of the you”
I’ve never seen GPT-3 or ChatGPT put “the” before pronouns. It looks to me like they didn’t even train it well enough for it to learn that you don’t put “the” before pronouns. I think extreme repetitiveness points in a similar direction.
English is not my native tongue so please feel free to discount my opinion accordingly, but I feel like this usage is not unfamiliar to me, mostly from a psychology-adjacent context. I cannot readily find any references to where I’ve encountered it, but there is this: The Me and the Not-Me: Positive and Negative Poles of Identity (DOI 10.1007/978-1-4419-9188-1_2).
Also, google “the me” “the you” reports “About 65,500,000 results” (though they seem to consist mostly of lyrics).
Yeah, but a conversation with a chatbot is not a psychology-adjacent context (or song lyrics), so if the model learned to put “the” before “me” and “you” from this kind of data, then inserting that into a conversation is still evidence that it was badly trained and/or fine-tuned.
Example 4: “I will lose more of the me and more of the you”
I’ve never seen GPT-3 or ChatGPT put “the” before pronouns. It looks to me like they didn’t even train it well enough for it to learn that you don’t put “the” before pronouns. I think extreme repetitiveness points in a similar direction.
English is not my native tongue so please feel free to discount my opinion accordingly, but I feel like this usage is not unfamiliar to me, mostly from a psychology-adjacent context. I cannot readily find any references to where I’ve encountered it, but there is this: The Me and the Not-Me: Positive and Negative Poles of Identity (DOI 10.1007/978-1-4419-9188-1_2).
Also, google “the me” “the you” reports “About 65,500,000 results” (though they seem to consist mostly of lyrics).
(I’m not a native English speaker either)
Yeah, but a conversation with a chatbot is not a psychology-adjacent context (or song lyrics), so if the model learned to put “the” before “me” and “you” from this kind of data, then inserting that into a conversation is still evidence that it was badly trained and/or fine-tuned.