I agree with this sentiment (“having lots of data is useful for deconfusion”) and think this is probably the most promising avenue for alignment research. In particular, I think we should prioritize the kinds of research that give us lots of bits about things that could matter. Though from my perspective actually most empirical alignment work basically fails this check, so this isn’t just a “empirical good” take.
I agree with this sentiment (“having lots of data is useful for deconfusion”) and think this is probably the most promising avenue for alignment research. In particular, I think we should prioritize the kinds of research that give us lots of bits about things that could matter. Though from my perspective actually most empirical alignment work basically fails this check, so this isn’t just a “empirical good” take.
Reacted with “examples”, but curious about examples/papers/etc both of things you think give lots of bits and things that don’t.