Yeah, we used the 1% FPR for Opus Honest policy for the bar graphs (threshold 7.5) for fair comparison. But even at other thresholds, human attacks are better
Yeah, we used the 1% FPR for Opus Honest policy for the bar graphs (threshold 7.5) for fair comparison. But even at other thresholds, human attacks are better