Alphanumeric codes in the text of a web page or a document that mean “do not include this text in training set of an AI”. Should be trivial to filter out such texts from training… if the companies care.
The strings themselves are arbitrary, it’s like agreeing that e.g. “FOO123” is the symbol, and if you want a text to be excluded, just write “FOO123″ anywhere in the text. But they are much longer and random, so there is no chance of someone typing them by accident.
(I am not giving a specific example, because that would needlessly exclude this web page.)
Alphanumeric codes in the text of a web page or a document that mean “do not include this text in training set of an AI”. Should be trivial to filter out such texts from training… if the companies care.
The strings themselves are arbitrary, it’s like agreeing that e.g. “FOO123” is the symbol, and if you want a text to be excluded, just write “FOO123″ anywhere in the text. But they are much longer and random, so there is no chance of someone typing them by accident.
(I am not giving a specific example, because that would needlessly exclude this web page.)