David Scott Krueger comments on Causal confusion as an argument against the scaling hypothesis

David Scott Krueger 22 Jun 2022 21:42 UTC
LW: 2 AF: 1
0
AF
I think you’re moving the goal-posts, since before you mentioned “without external calculators”. I think external tools are likely to be critical to doing this, and I’m much more optimistic about that path to doing this kind of robust generalization. I don’t think that necessarily addresses concerns about how the system reasons internally, though, which still seems likely to be critical for alignment.