RSS

Ilia Shirokov

Karma: 21

Steganog­ra­phy via in­ter­nal ac­ti­va­tions is already pos­si­ble in small lan­guage mod­els — a po­ten­tial first step to­ward per­sis­tent hid­den rea­son­ing.

9 Aug 2025 11:44 UTC
7 points
0 comments12 min readLW link

Sleep peace­fully: no hid­den rea­son­ing de­tected in LLMs. Well, at least in small ones.

4 Apr 2025 20:49 UTC
17 points
2 comments7 min readLW link