Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Pranav Gade
Karma:
143
All
Posts
Comments
New
Top
Old
Apply to the AI Security Bootcamp [Aug 4 - Aug 29]
Pranav Gade
,
Jan Michelfeit
and
Jinglin Li
1 Jun 2025 19:47 UTC
27
points
1
comment
4
min read
LW
link
unRLHF—Efficiently undoing LLM safeguards
Pranav Gade
,
Jeffrey Ladish
and
Simon Lermen
12 Oct 2023 19:58 UTC
117
points
15
comments
20
min read
LW
link
Back to top