Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Andrew Mack
Karma:
328
All
Posts
Comments
New
Top
Old
Deep Causal Transcoding: A Framework for Mechanistically Eliciting Latent Behaviors in Language Models
Andrew Mack
and
TurnTrout
3 Dec 2024 21:19 UTC
106
points
8
comments
41
min read
LW
link
Mechanistically Eliciting Latent Behaviors in Language Models
Andrew Mack
and
TurnTrout
30 Apr 2024 18:51 UTC
221
points
43
comments
45
min read
LW
link
Back to top