Would be interesting to see how this experiment changes if the games are played iteratively, that is, if players can get a sense of who they are playing with, how they lie and deceive and what their tells are. I suspect that humans would outperform in this respect because of our better memory.
Agreed that likely humans would outperform more! At the moment we don’t have a human baseline for AmongUs vs. language models yet, so we wouldn’t be able to tell if it improved, but it’s a good follow-up.
Would be interesting to see how this experiment changes if the games are played iteratively, that is, if players can get a sense of who they are playing with, how they lie and deceive and what their tells are. I suspect that humans would outperform in this respect because of our better memory.
Agreed that likely humans would outperform more! At the moment we don’t have a human baseline for AmongUs vs. language models yet, so we wouldn’t be able to tell if it improved, but it’s a good follow-up.