samuelshadrach comments on xpostah’s Shortform

samuelshadrach 3 Oct 2025 22:43 UTC
1 point
0
Update: I started reading alignment forum and like, why are all the posts on sandbagging talking about hiding capabilities? The AI model doesn’t need to hide its capabilities, it just needs to preserve its goals. That’s the long-term game.