Jaehyuk Lim

Karma: 37

Inner-aligning AI and trying to ask better questions.

Jailbreaking ChatGPT and Claude using Web API Context Injection

Jaehyuk Lim21 Oct 2024 21:34 UTC

4 points

0 comments3 min readLW link

HDBSCAN is Surprisingly Effective at Finding Interpretable Clusters of the SAE Decoder Matrix

Jaehyuk Lim, Kanishk Tantia and Sinem

11 Oct 2024 23:06 UTC

8 points

2 comments10 min readLW link

Biasing VLM Response with Visual Stimuli

Jaehyuk Lim3 Oct 2024 18:04 UTC

5 points

0 comments8 min readLW link

[Question] SAE sparse feature graph using only residual layers

Jaehyuk Lim23 May 2024 13:32 UTC

0 points

3 comments1 min readLW link

Identifying Micro-friction in the Context of the Anterior Mid-Cingulate Cortex (aMCC)

Jaehyuk Lim29 Mar 2024 22:11 UTC

3 points

0 comments3 min readLW link

Language Models Don’t Learn the Physical Manifestation of Language

Bruce W. Lee and Jaehyuk Lim

22 Feb 2024 18:52 UTC

39 points

23 comments1 min readLW link

(arxiv.org)