Near the beginning, Daniel is basically asking Jan how they plan on aligning the automated alignment researcher, and if they can do that, then it seems that there wouldn’t be much left for the AAR to do.
Jan doesn’t seem to comprehend the question, which is not an encouraging sign.
I think I probably didn’t quite word that question right, and that’s what’s explaining the confusion—I meant something like “Once you’ve created the AAR, what alignment problems are left to be solved? Please answer in terms of the gap between the AAR and superintelligence.”
Near the beginning, Daniel is basically asking Jan how they plan on aligning the automated alignment researcher, and if they can do that, then it seems that there wouldn’t be much left for the AAR to do.
Jan doesn’t seem to comprehend the question, which is not an encouraging sign.
I think I probably didn’t quite word that question right, and that’s what’s explaining the confusion—I meant something like “Once you’ve created the AAR, what alignment problems are left to be solved? Please answer in terms of the gap between the AAR and superintelligence.”