Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
megasilverfist comments on
Profanity causes emergent misalignment, but with qualitatively different results than insecure code
megasilverfist
7 Feb 2026 12:42 UTC
1
point
0
Me and David are doing some followup work on EM, but mostly didn’t follow this branch.
Back to top
Me and David are doing some followup work on EM, but mostly didn’t follow this branch.