Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Ilya Nachevsky
Karma:
4
All
Posts
Comments
New
Top
Old
Steganography via internal activations is already possible in small language models — a potential first step toward persistent hidden reasoning.
Ilia Shirokov
and
Ilya Nachevsky
9 Aug 2025 11:44 UTC
7
points
7
comments
12
min read
LW
link
Sleep peacefully: no hidden reasoning detected in LLMs. Well, at least in small ones.
Ilia Shirokov
and
Ilya Nachevsky
4 Apr 2025 20:49 UTC
17
points
4
comments
7
min read
LW
link
Back to top