StanislavKrym comments on It Is Untenable That Near-Future AI Scenario Models Like “AI 2027” Don’t Include Open Source AI

StanislavKrym 16 May 2025 3:24 UTC
1 point
0
The problem with non-open-weight models is that they need to be exfiltrated before wrecking havoc, while open-weight models cannot avoid being evaluated. Suppose that the USG decides that all open-weight models are to be tested by OpenBrain for being aligned or misaligned. Then even a misaligned Agent-x has no reason to blow its cover by failing to report an open-weight rival.