I haven’t assumed away inner misalignment, the unwanted emergence of instrumentally convergent type goal seeking is a key premise of inner misalignment theory. I briefly mention Hubingers work specifically too.
I’d say disease elimination is quantitatively but not qualitatively different to stadium building. “Eliminating threatening agents” still doesn’t make the list of necessities; there’s still a lot of exceptionally good but recognisable science, negotiation, project management, office politics to do. NK type issues I sort of call out as riskier.
(It’s also not obvious that disease elimination simulation is harder than football stadium simulation, but both seem kind of sci fi to me tbh)
I haven’t assumed away inner misalignment, the unwanted emergence of instrumentally convergent type goal seeking is a key premise of inner misalignment theory. I briefly mention Hubingers work specifically too.
I’d say disease elimination is quantitatively but not qualitatively different to stadium building. “Eliminating threatening agents” still doesn’t make the list of necessities; there’s still a lot of exceptionally good but recognisable science, negotiation, project management, office politics to do. NK type issues I sort of call out as riskier.
(It’s also not obvious that disease elimination simulation is harder than football stadium simulation, but both seem kind of sci fi to me tbh)