RSS

Jiachen Zhao

Karma: 25

LLMs En­code Harm­ful­ness and Re­fusal Separately

Jiachen Zhao22 Jul 2025 18:53 UTC
24 points
4 comments8 min readLW link
(www.arxiv.org)