Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Nikola Jurkovic comments on
Opus 4.6 Reasoning Doesn’t Verbalize Alignment Faking, but Behavior Persists
Nikola Jurkovic
10 Mar 2026 20:49 UTC
2
points
0
I would be really interested to see the results of other companies’ models on this!
Back to top
I would be really interested to see the results of other companies’ models on this!