But it’d be incredibly easy to train CLIP in such a way that it has to tell the difference between similar images much more often
I don’t think that’s true.
(I’m choosing not to respond to this comment for reasons relating to potential info-hazards)
I don’t think that’s true.
(I’m choosing not to respond to this comment for reasons relating to potential info-hazards)