Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Oliver Daniels comments on
Test your interpretability techniques by de-censoring Chinese models
Oliver Daniels
16 Jan 2026 2:34 UTC
1
point
0
nice, looks promising!
Back to top
nice, looks promising!