Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Matthew Bozoukov
Karma:
34
All
Posts
Comments
New
Top
Old
Transmitting Misalignment with Subliminal Learning via Paraphrasing
Matthew Bozoukov
,
Taywon Min
,
CallumMcDougall
and
J Rosser
17 Dec 2025 19:34 UTC
38
points
0
comments
10
min read
LW
link
Manipulating Self-Preference In LLMs
Matthew Nguyen
,
Jou Barzdukas
,
Matthew Bozoukov
and
Hongyu Fu
1 Jul 2025 18:03 UTC
13
points
0
comments
7
min read
LW
link
Back to top