Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Artur Zolkowski
Karma:
36
All
Posts
Comments
New
Top
Old
Can Reasoning Models Obfuscate Reasoning? Stress-Testing Chain-of-Thought Monitorability
Artur Zolkowski
and
Wen Xing
24 Oct 2025 17:21 UTC
18
points
1
comment
5
min read
LW
link
Early Signs of Steganographic Capabilities in Frontier LLMs
Kei Nishimura-Gasparian
,
Artur Zolkowski
,
robert mccarthy
and
David Lindner
4 Jul 2025 16:36 UTC
33
points
5
comments
2
min read
LW
link
Back to top