Ilya Nachevsky

Karma: 4

Steganography via internal activations is already possible in small language models — a potential first step toward persistent hidden reasoning.

Ilia Shirokov and Ilya Nachevsky

9 Aug 2025 11:44 UTC

7 points

7 comments12 min readLW link

Sleep peacefully: no hidden reasoning detected in LLMs. Well, at least in small ones.

Ilia Shirokov and Ilya Nachevsky

4 Apr 2025 20:49 UTC

17 points

4 comments7 min readLW link