Christopher King comments on [New LW Feature] “Debates”

Christopher King 1 Apr 2023 23:33 UTC
1 point
0
Question from the audience: AI safety via debate’s foundation comes from computational complexity theory. It’s a type of interactive complexity class. But can we really expect humans to understand such a complex protocol? And if not, where do the safety guarantees come from?