Cleo Nardo comments on Shortform

Cleo Nardo 1 Feb 2025 15:47 UTC
3 points
0
I don’t think this works when the AIs are smart and reasoning in-context, which is the case where scheming matters. Also this maybe backfires by making scheming more salient.
Still, might be worth running an experiment.