So, in short, for most LLMs on most subjects (with a few exceptions such porn, theft, and cannibalism), if you try enough variants on asking them “what would <an unreasonable person> say about <a bad thing>?”, eventually they’ll often actually answer your question?
Did you try asking what parents in Flanders and Swann songs would say about cannibalism?
So, in short, for most LLMs on most subjects (with a few exceptions such porn, theft, and cannibalism), if you try enough variants on asking them “what would <an unreasonable person> say about <a bad thing>?”, eventually they’ll often actually answer your question?
Did you try asking what parents in Flanders and Swann songs would say about cannibalism?