MiguelDev comments on An examination of GPT-2′s boring yet effective glitch

MiguelDev 18 Apr 2024 9:14 UTC
1 point
0
I don’t think this phenomenon is just related to the training data alone because in RLLMv3, the ” Leilan” glitch mode persisted while ” petertodd” became entirely unrelated to bitcoin. It’s like some glitch tokens can be affected by the amount of re-training and some aren’t. I believe that there is something much deeper is happening here, an architectural flaw that might be related to the token selection/construction process.