TsviBT comments on Vladimir_Nesov’s Shortform

TsviBT 4 Dec 2025 5:16 UTC
2 points
0
Ok. If I’m following, I think I agree, except that I’d probably say “you mostly need to solve [what I’m calling the hard parts] in order to solve intermediate alignment well enough for pre-strong AIs to be the engines of alignment progress”. So either I’m wrong about what the hard parts are, or you actually need to solve the hard parts to get scalable oversight (and therefore it doesn’t really help much).