Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
evhub comments on
Red-teaming language models via activation engineering
evhub
26 Aug 2023 22:04 UTC
LW: 3 AF: 3
0
AF
(Moderation note: added to the Alignment Forum from LessWrong.)
Back to top
(Moderation note: added to the Alignment Forum from LessWrong.)