Nathan Helm-Burger comments on dalle2 comments

Nathan Helm-Burger 26 May 2022 4:31 UTC
1 point
0
Here’s some work by folks at deepmind looking at model’s relational understanding (verbs) vs subjects and objects. Kinda relevant to the type of misunderstanding CLIP tends to exhibit. https://www.deepmind.com/publications/probing-image-language-transformers-for-verb-understanding