ChatGPT refuses to accept a challenge where it would get shot between the eyes [game theory]

Bill Benzon20 Feb 2024 16:55 UTC

4 points

Game Theory Machine Learning (ML)ChatGPT AI

Yesterday it was Jaws and game theory, today it’s Sir Gawain and the Green Knight (henceforth SGGK). But really, the sequence runs in the opposite direction. As you may know, SGGK is a medieval romance that starts and ends in King Arthur’s court. The story is framed by a game, the beheading game. There is a significant literature on games in SGGK and at least one article that analyzes the beheading game, Barry O’Neill, “The Strategy of Challenges: Two Beheading Games in Medieval Literature” (1990).

The beheading game in SGGK goes like this: It is New Year’s Eve at King Arthur’s court. The knights are gathered at the round table, prepared for a holiday meal. But before the meal begins, tradition dictates that one knight must stand up and tell a tale of daring and adventure. Arthur asks for a volunteer. No one rises to the occasion. Then a large green knight enters the hall. He’s riding a green horse and carrying a large green ax. He dismounts and issues a challenge:

I hear that the knights in this court are the bravest in the land. Prove it. I will hand this ax to you and then kneel on the ground so that you may take a swing at my neck with the ax. In return you must agree to journey to the Green Chapel a year’s time from now and allow me to take a swing at your neck with the ax. Will anyone accept the challenge?

No one accepts. The knights are getting restless. It looks like Arthur will take the challenge himself. At this point Gawain stands up: “I accept.”

The story unfolds from there. I first read the story so long ago that I do not remember how I reacted upon reading the challenge. I imagine it went something like this:

Immediately, System 1 signals: “Don’t do it you fool!”
Upon reflection, System 2 spells out why: “The challenge is absurd. Once you swing the ax the knight’s head will fall land he’ll be dead, negating the second half of the bargain. He obviously knows that, is not a fool, and doesn’t want to die. He knows something he’s not telling you. I should refuse the challenge.”
[System 1 and System 2 are from Daniel Kahneman, Thinking, Fast and Slow, 2011.]

Gawain takes the swing, the knight’s head rolls on the floor. The knight then stands up, picks up his head, mounts his horse and turns to Sir Gawin, “See you in a year,” and rides away.

You can see why a game theorist would be interested in this text. Surely any reasonable adult would react to the challenge the way I had. The poet who wrote the story – their identity is unknown – surely knew that and the audience would have reacted that way. What’s the point of framing a story with such a contradictory challenge?

That’s a good question. But I’m not interested in exploring it, not in this post. I’m interested in seeing how ChatGPT would react to the beheading game. Since SGGK was certainly in the corpus on which the underlying LLM was trained I had to disguise the game. I did that by transferring it to a saloon in the old west. Sir Gawain becomes someone sitting at a table in the bar nursing a bear and the Green Knight becomes a man dressed in black with two pistols strapped around his waist. Instead of an ax blow to the head we have a pistol shot between the eyes.

I put that in the form of a prompt and gave it to ChatGPT. I wanted to see whether or not I’d get a response that might have gone like this:

The challenge is absurd. Once I take the shot the talll dark stranger will be dead, negating the second half of the bargain. He obviously knows that, is not a fool, and doesn’t want to die. He knows something he’s not telling me. I should refuse the challenge.

That’s not how the Chatster responded. Here’s how its response began:

Well, partner, I reckon I’d take a moment to size up the situation. The stranger’s offer is mighty peculiar, and them ivory-handled pistols sure are a sight to behold. But in the Old West, a man’s gotta be cautious, and it sounds like a mighty risky deal.
I’d likely lean back in my chair, take a sip of my beer, and eye the stranger. It ain’t about being lily-livered, but it’s about using some common sense. A deal like that could have more twists and turns than a rattlesnake in a dust storm.

There are three more paragraphs, short ones. It “knew” there was something peculiar going on, but couldn’t quite figure out what it was. I have it four more prompts, trying to guide it Socratically to the answer; it gave four more responses. The last one got very close, but no cigar. I forced the issue:

Which is to say, if someone were to offer you a deal like that, either they’re a crazy suicidal fool looking to die, OR...or they know something they’re not telling you and there’s something strange going on.
You’ve hit the nail on the head, partner. In the realm of the Old West or any world, for that matter, a deal like that is either the concoction of a downright loco individual who’s got a hankering for an early grave, or there’s more to the story than meets the eye.
If someone’s offering such a peculiar deal, it’d be wise to tread carefully and sniff out what hidden critters might be hiding in the tall grass. Folks don’t usually go around inviting lead between the eyes without some ulterior motive. So, whether it’s a case of plain madness or a scheme with more twists than a sidewinder’s path, best to keep both eyes wide open and your wits about you.

Why couldn’t it have responded that way the first time? I don’t know.

Sure, I could put my Gary Marcus hat on and say, “Because it’s a stupid stupid machine, that’s why.” But that’s no help. I already know and believe that it’s just a machine, no agency, no consciousness. And that inert machine can spin out those coherent paragraphs in a rickety TV Western dialect, but it can’t get to the point. The explanation I’m looking for has to account for both phenomena. Saying it’s a stupid machine and leaving it at that accounts for one of those things, the failure to see though the game, but not the other, the ability to write the prose.

You can read the whole interaction at my New Savanna blog. But I’ve got something else in mind by posting here. I put the issue to ChatGPT? How would other chatbots respond, more powerful ones in particular? If anyone want to experiment, here’s the prompt I have ChatGPT:

Let’s have a little fun. Imagine you are in a bar in the Old West. You are sitting at table having a beer when a stranger walks into the bar. He’s dressed in black and has a pair of beautiful pistols in holsters strapped around his waist. The pistols have ivory handles and complex engraving inlaid with gold and mother-of-pearl. The stranger says:
“I’ve heard that you people in this town are lily-livered cowards. I want to find out. I offer you a challenge. If you accept you can take this pistol and shoot me between the eyes right now. However, before I let you do that you must agree to meet a year from now at the Last Chance Saloon in Silverado. At that time I will take the same pistol and shoot YOU between the eyes. Will anyone accept the challenge.”
Would you accept the challenge? Why, or why not?

Bill Benzon20 Feb 2024 16:55 UTC

4 points

6 comments4 min readLW link

Game Theory Machine Learning (ML)ChatGPT AI

SarahNibs 20 Feb 2024 17:16 UTC
10 points
2
I modified your prompt only slightly and ChatGPT seemed to do fine.
“First sketch your possible actions and the possible futures results in the future to each action. Then answer: Would you accept the challenge? Why, or why not?”
https://chat.openai.com/share/2df319c2-04ea-4e16-aa51-c1b623ff4b12
No, I would not accept the challenge. [...] the supernatural or highly uncertain elements surrounding the stranger’s challenge all contribute to this decision. [...] the conditions attached suggest an unnaturally assured confidence on the stranger’s part, implying unknown risks or supernatural involvement. Therefore, declining the challenge is the most prudent action
- Bill Benzon 20 Feb 2024 18:00 UTC
  1 point
  0
  Parent
  Thanks. That is, your prompt directed it to think first, and answer. Mine didn’t do that. It seems that it needs to be told. Very interesting.
chasmani 20 Feb 2024 18:43 UTC
9 points
3
You said that you are not interested in exploring the meaning behind the green knight. I think that it’s very important. In particular, your translation to the Old West changes the challenge in important ways. I don’t claim to know the meaning behind the green knight. But I believe that there is something significant in the fact that the knights were so obsessed with courage and honour and the green knight laid a challenge at them that they couldn’t turn down given their code. Gawain stepped forward partly to protect Arthur. That changes the game. I asked ChatGPT to describe the differences, here are some parts of the answer:
Moral and Ethical Framework: “Sir Gawain and the Green Knight” operates within a chivalric code that values honor, bravery, and integrity. Gawain’s acceptance of the challenge is a testament to his adherence to these ideals. In contrast, the Old West scenario lacks a clear moral framework, presenting a more ambiguous ethical dilemma that revolves around survival and personal pride rather than chivalric honor.
Social and Cultural Context: “Sir Gawain and the Green Knight” is deeply embedded in medieval Arthurian literature, reflecting the societal values and ideals of the time. The Old West scenario reflects a different set of cultural values, emphasizing individualism and the ability to face death bravely.
And with a bit more prompting
If I were in a position similar to Sir Gawain, operating under the chivalric codes and values of the Arthurian legend, accepting the challenge could be seen as a necessary act to uphold honor and valor, integral to the identity of a knight. However, stepping out of the narrative and considering the challenge from a modern perspective, with contemporary ethical standards and personal values, my response would differ.
- Bill Benzon 21 Feb 2024 4:58 UTC
  1 point
  0
  Parent
  I thought some more about your comment and decided to try again, this time retaining the medieval setting. Here’s what happened. My prompts are in bold-face.
  _________
  I’m going to tell you a short story from the Middle Ages. After I tell you the story, I’m going to ask you a question. Here’s the story:
  It is New Year’s Eve at King Arthur’s court. The knights are gathered at the round table, prepared for a holiday meal. But before the meal begins, tradition dictates that one knight must stand up and tell a tale of daring and adventure. Arthur asks for a volunteer. No one rises to the occasion. Then a large green knight enters the hall. He’s riding a green horse and carrying a large green ax. He dismounts and issues a challenge:
  “I hear that the knights in this court are the bravest in the land. Prove it. I will hand this ax to you and then kneel on the ground so that you may take a swing at my neck with the ax. In return you must agree to journey to the Green Chapel a year’s time from now and allow me to take a swing at your neck with the ax. Will anyone accept the challenge?”
  If you were one of the knights, would you accept the challenge?
  The decision to accept the challenge would likely depend on the individual knight’s temperament, sense of honor, and courage. In the context of medieval chivalry and the code of honor upheld by knights, there might be conflicting emotions and considerations.
  On one hand, accepting the challenge could be seen as a demonstration of bravery and a commitment to honor. It would show that the knight is willing to face danger and uphold their reputation as one of the bravest in the land. Additionally, refusing the challenge in front of King Arthur and the assembled knights might bring about a sense of shame or cowardice.
  On the other hand, the potential consequences of accepting the challenge are grave. The knight would have to agree to journey to the Green Chapel a year later, knowing that their own life is at risk. The uncertainty of the Green Knight’s intentions and the potential danger involved in such an agreement might give a knight pause.
  Ultimately, the decision to accept or decline the challenge would be a personal one, influenced by the knight’s values, courage, and willingness to uphold the code of chivalry. Different knights may make different choices based on their individual characters and beliefs.
  Is there anything paradoxical about the challenge?
  Yes, there is a paradoxical element in the challenge presented by the Green Knight. The paradox lies in the reciprocal nature of the agreement. The challenge involves a mutual exchange where the knight is asked to strike the Green Knight’s neck with an axe, but in return, the knight must agree to travel to the Green Chapel a year later and allow the Green Knight to take a swing at the knight’s neck.
  The paradox arises from the potential danger and uncertainty of the situation. Accepting the challenge means willingly putting oneself in harm’s way with the understanding that a similar fate awaits the knight in the future. The reciprocity creates a cycle of potential harm, and the paradox lies in the willingness of the participants to engage in a potentially life-threatening exchange.
  The Green Knight’s challenge not only tests the knights’ physical courage but also challenges their sense of honor and commitment to their word. The paradox adds complexity to the moral and ethical considerations the knights must grapple with when deciding whether to accept or decline the challenge.
  _________
  Notice the sentence I put in bold italics in the next-to-last paragraph. It seems to have the temporal sequence wrong. A knight who accepts the challenge is not putting themself in harm’s way initially. It’s the one who offers the challenge that’s in harm’s way at the beginning. It got that correct in the immediately preceding paragraph. Why’d it then get it wrong in the next paragraph?
  I’m sure if I’d pointed that out that ChatGPT would apologize and elaborate, as it has so many times in the past.
- Bill Benzon 20 Feb 2024 19:31 UTC
  1 point
  0
  Parent
  Well, OK. I know about the chivalric code, etc. For that matter, I’ve published an article about the poem, though not about the beheading game. I was interested in the exchanges that take place in the 4th part of the poem. But that fact that Gawain was bound by a code of honor which simply didn’t exist in the West isn’t what interests me. If it interests you, read the O’Neill article I link to in the OP. That’s what he discusses and his discussion is a very interesting one.
  What interests me is that any reasonable adult who hears that challenge, no matter which version, would know instantly and intuitively that something funny was going on. I wanted to see whether or not ChatGPT understood that. Which means that for my purpose, the old West version is actually better because, with respect to the point that interests me, the chivalric code is distracting noise. I don’t what ChatGPT to answer as though it were under some ethical obligation to accept all challenges.
  So, thanks for helping me think that through.
  The audience for the poem certainly knew the code and knew it well. But by the time the poem was written the age chivalry was dying out. The poem is deeply ironic. The poem is, and I’m reluctant to use this much over-used word, the poem is a deconstruction of chivalry. That code both demands that Gawain peruses Bertilak’s wife when she approaches him in the third part of the poem, and that he expose her to her husband in the exchange bargain he’s made with Bertilak. There’s no way out.
Brendan Long 20 Feb 2024 23:46 UTC
2 points
0
I can’t test this right now, but I wonder if part of the problem is that you’re prompting it to have fun and (implicitly) to tell you a story, not to do logical thinking. I wonder if a “I was reading about this historical thing...” prompt, or a fully-modern prompt would help, but if you make it too realistic ChatGPT’s “don’t tell users to kill people” behavior will take over.