Apparent moral: Apparent corrigibility from training or surface behavior doesn’t fix inner goals; without hard guarantees and control, you lose.
Actual moral: First to superintelligence wins; second place is ash.
First to superintelligence wins
This phrasing seems ambiguous between the claims “the first agent to BE superintelligent wins” and “the first agent to CREATE something superintelligent wins”.
This distinction might be pretty important to your strategy.
Apparent moral: Apparent corrigibility from training or surface behavior doesn’t fix inner goals; without hard guarantees and control, you lose.
Actual moral: First to superintelligence wins; second place is ash.
This phrasing seems ambiguous between the claims “the first agent to BE superintelligent wins” and “the first agent to CREATE something superintelligent wins”.
This distinction might be pretty important to your strategy.