Capybasilisk comments on AXRP Episode 24 - Superalignment with Jan Leike

Capybasilisk 29 Jul 2023 22:03 UTC
LW: 1 AF: 1
0
AF
Near the beginning, Daniel is basically asking Jan how they plan on aligning the automated alignment researcher, and if they can do that, then it seems that there wouldn’t be much left for the AAR to do.

Jan doesn’t seem to comprehend the question, which is not an encouraging sign.
- DanielFilan 31 Jul 2023 16:30 UTC
  LW: 3 AF: 3
  0
  AF Parent
  I think I probably didn’t quite word that question right, and that’s what’s explaining the confusion—I meant something like “Once you’ve created the AAR, what alignment problems are left to be solved? Please answer in terms of the gap between the AAR and superintelligence.”