Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Usman Anwar
Karma:
47
All
Posts
Comments
New
Top
Old
Paraphrasing Is (At Best) a Partial Defence Against Steganography in LLMs
Usman Anwar
and
robertzk
3 May 2026 7:53 UTC
7
points
0
comments
8
min read
LW
link
A Sober Look at Steering Vectors for LLMs
Joschka Braun
,
Dmitrii Krasheninnikov
,
Usman Anwar
,
RobertKirk
,
Daniel Tan
and
David Scott Krueger
23 Nov 2024 17:30 UTC
42
points
0
comments
5
min read
LW
link
Back to top