The Stego results surprised me given the probe was trained on deliberative collusion. How robust is the deceptive direction to collusion types that look nothing like what is in the benchmark?
The Stego results surprised me given the probe was trained on deliberative collusion. How robust is the deceptive direction to collusion types that look nothing like what is in the benchmark?