One major counterargument here is “is control a necessary piece of the ‘solve alignment in time plan’”? Like, it may be “5-10x less important” than dealing with slop, but, still, in if you didn’t eventually solve both you don’t get useful carefully-implemented-slightly-superhuman work done, and it (might be) that our surviving worlds look like either that, or “get a serious longterm pause.”
One major counterargument here is “is control a necessary piece of the ‘solve alignment in time plan’”? Like, it may be “5-10x less important” than dealing with slop, but, still, in if you didn’t eventually solve both you don’t get useful carefully-implemented-slightly-superhuman work done, and it (might be) that our surviving worlds look like either that, or “get a serious longterm pause.”