evhub comments on Watermarking considered overrated?

evhub 31 Jul 2023 21:43 UTC
LW: 18 AF: 10
3
AF
I think that there’s a very real benefit to watermarking that is often overlooked, which is that it lets you filter AI-generated data out of your pre-training corpus. That could be quite important for avoiding some of the dangerous failure modes around models predicting other AIs (e.g. an otherwise safe predictor could cause a catastrophe if it starts predicting a superintelligent deceptive AI) that we talk about in “Conditioning Predictive Models”.