Provided people keep playing this game, this will eventually happen anyway. And if in that eventual released log of an AI victory, the gatekeeper is persuaded by less compelling strategies than yours, it would be even easier to believe “it couldn’t happen to me”.
Secondly, since we’re assuming Oracle AI is possible and boxing seems to be most people’s default strategy for when that happens, there will be future gatekeepers facing actual AIs. Shouldn’t you try to immunize them against at least some of the strategies AIs could conceivably discover independently?
The number of people actually playing this game is quite small, and the number of winning AIs is even smaller (to the point where Tuxedage can charge $750 a round and isn’t immediately flooded with competitors). And secrecy is considered part of the game’s standard rules. So it is not obvious that AI win logs will eventually be released anyway.
The number of people actually playing this game is quite small, and the number of winning AIs is even smaller (to the point where Tuxedage can charge $750 a round and isn’t immediately flooded with competitors).
A round seems to need the 2 hours on the chat but also many hours in background research. If we say 8 hours background research and script writing that would equal $75/hour. I think that most people with advanced persuasion skills can make a higher hourly rate.
Shouldn’t you try to immunize them against at least some of the strategies AIs could conceivably discover independently?
I don’t think reading a few logs would immunize someone. If you wanted to immunize someone I would suggest a few years of therapy with a good psychologist to work through any trauma’s that exist in that person’s life and the existential questions.
I would add many hours in meditation to have learn to have control over your own mind.
You could train someone to precommit and build emotional endurance. If someone can take highly addictive drugs and has a enough control over his own mind to refuse them when put a few hours alone in a room with them I would trust them more to stay emotionally stable in front of an AI.
You could also require gatekeepers to have played the AI role in the experiment a few times.
You might also look into techniques that the military teaches soldiers to resist torture.
But even with all these safety measures it’s still dangerous.
Provided people keep playing this game, this will eventually happen anyway. And if in that eventual released log of an AI victory, the gatekeeper is persuaded by less compelling strategies than yours, it would be even easier to believe “it couldn’t happen to me”.
Secondly, since we’re assuming Oracle AI is possible and boxing seems to be most people’s default strategy for when that happens, there will be future gatekeepers facing actual AIs. Shouldn’t you try to immunize them against at least some of the strategies AIs could conceivably discover independently?
The number of people actually playing this game is quite small, and the number of winning AIs is even smaller (to the point where Tuxedage can charge $750 a round and isn’t immediately flooded with competitors). And secrecy is considered part of the game’s standard rules. So it is not obvious that AI win logs will eventually be released anyway.
A round seems to need the 2 hours on the chat but also many hours in background research. If we say 8 hours background research and script writing that would equal $75/hour. I think that most people with advanced persuasion skills can make a higher hourly rate.
I don’t think reading a few logs would immunize someone. If you wanted to immunize someone I would suggest a few years of therapy with a good psychologist to work through any trauma’s that exist in that person’s life and the existential questions.
I would add many hours in meditation to have learn to have control over your own mind.
You could train someone to precommit and build emotional endurance. If someone can take highly addictive drugs and has a enough control over his own mind to refuse them when put a few hours alone in a room with them I would trust them more to stay emotionally stable in front of an AI.
You could also require gatekeepers to have played the AI role in the experiment a few times.
You might also look into techniques that the military teaches soldiers to resist torture.
But even with all these safety measures it’s still dangerous.