Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Dylan Feng
Karma:
205
All
Posts
Comments
New
Top
Old
Weird Generalization & Inductive Backdoors
Jorio Cocola
,
Owain_Evans
and
Dylan Feng
11 Dec 2025 18:18 UTC
153
points
8
comments
8
min read
LW
link
Concept Poisoning: Probing LLMs without probes
Jan Betley
,
Jorio Cocola
,
Dylan Feng
and
Owain_Evans
5 Aug 2025 17:00 UTC
60
points
5
comments
13
min read
LW
link
Back to top