Fabien Roger comments on Rogue internal deployments via external APIs

Fabien Roger 16 Oct 2025 12:27 UTC
4 points
−1
(I had elements on persuasion which I think roughly covered what you are describing here. I also had elements on poisoning the data of future models which covers the “triggering emergent misalignment of AIs it trains” you describe above.)