Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Wen Xing
Karma:
31
All
Posts
Comments
New
Top
Old
Can Reasoning Models Obfuscate Reasoning? Stress-Testing Chain-of-Thought Monitorability
Artur Zolkowski
and
Wen Xing
24 Oct 2025 17:21 UTC
18
points
1
comment
5
min read
LW
link
Vulnerability in Trusted Monitoring and Mitigations
Wen Xing
and
Perusha Moodley
7 Jun 2025 7:16 UTC
17
points
1
comment
7
min read
LW
link
Back to top