delton137 comments on Visible Thoughts Project and Bounty Announcement

delton137 30 Nov 2021 15:15 UTC
5 points
0
I don’t have much direct experience with transformers (I was part of some research with BERT once where we found it was really hard to use without adding hard-coded rules on top, but I have no experience with the modern GPT stuff). However, what you are saying makes a lot of sense to me based on my experience with CNNs and the attempts I’ve seen to explain/justify CNN behaviour with side channels (for instance this medical image classification system that also generates text as a side output).

See also my comment on Facebook.