I’d believe the claim if I thought that alignment was easy enough that AI products that pass internal product review and which don’t immediately trigger lawsuits would be aligned enough to not end the world through alignment failure. But I don’t think that’s the case, unfortunately.
It seems like we’ll have to put special effort into both single/single alignment and multi/single “alignment”, because the free market might not give it to us.
I’d believe the claim if I thought that alignment was easy enough that AI products that pass internal product review and which don’t immediately trigger lawsuits would be aligned enough to not end the world through alignment failure. But I don’t think that’s the case, unfortunately.
It seems like we’ll have to put special effort into both single/single alignment and multi/single “alignment”, because the free market might not give it to us.