Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Jaehyuk Lim
Karma:
37
Inner-aligning AI and trying to ask better questions.
All
Posts
Comments
New
Top
Old
Jailbreaking ChatGPT and Claude using Web API Context Injection
Jaehyuk Lim
21 Oct 2024 21:34 UTC
4
points
0
comments
3
min read
LW
link
HDBSCAN is Surprisingly Effective at Finding Interpretable Clusters of the SAE Decoder Matrix
Jaehyuk Lim
,
Kanishk Tantia
and
Sinem
11 Oct 2024 23:06 UTC
8
points
2
comments
10
min read
LW
link
Biasing VLM Response with Visual Stimuli
Jaehyuk Lim
3 Oct 2024 18:04 UTC
5
points
0
comments
8
min read
LW
link
[Question]
SAE sparse feature graph using only residual layers
Jaehyuk Lim
23 May 2024 13:32 UTC
0
points
3
comments
1
min read
LW
link
Identifying Micro-friction in the Context of the Anterior Mid-Cingulate Cortex (aMCC)
Jaehyuk Lim
29 Mar 2024 22:11 UTC
3
points
0
comments
3
min read
LW
link
Language Models Don’t Learn the Physical Manifestation of Language
Bruce W. Lee
and
Jaehyuk Lim
22 Feb 2024 18:52 UTC
39
points
23
comments
1
min read
LW
link
(arxiv.org)
Back to top