Gordon Seidoh Worley comments on New paper: (When) is Truth-telling Favored in AI debate?

Gordon Seidoh Worley 4 Jan 2020 0:02 UTC
LW: 3 AF: 2
0
AF
If it’s interesting to you, I’d be happy to talk about my ideas around AI safety via dialectic, an approach that can be made to look like debate and generally fits the IDA paradigm, and encourage you to run with the idea if you like. I wrote vaguely about the idea a while back, and think it could be interesting to pursue, but am not actively working on it because I don’t think it has the highest comparative leverage for me.