RSS

Jaehyuk Lim

Karma: 37

Inner-aligning AI and trying to ask better questions.

Jailbreak­ing ChatGPT and Claude us­ing Web API Con­text Injection

Jaehyuk Lim21 Oct 2024 21:34 UTC
4 points
0 comments3 min readLW link

HDBSCAN is Sur­pris­ingly Effec­tive at Find­ing In­ter­pretable Clusters of the SAE De­coder Matrix

11 Oct 2024 23:06 UTC
8 points
2 comments10 min readLW link

Bi­as­ing VLM Re­sponse with Vi­sual Stimuli

Jaehyuk Lim3 Oct 2024 18:04 UTC
5 points
0 comments8 min readLW link

[Question] SAE sparse fea­ture graph us­ing only resi­d­ual layers

Jaehyuk Lim23 May 2024 13:32 UTC
0 points
3 comments1 min readLW link

Iden­ti­fy­ing Micro-fric­tion in the Con­text of the An­te­rior Mid-Cin­gu­late Cor­tex (aMCC)

Jaehyuk Lim29 Mar 2024 22:11 UTC
3 points
0 comments3 min readLW link

Lan­guage Models Don’t Learn the Phys­i­cal Man­i­fes­ta­tion of Language

22 Feb 2024 18:52 UTC
39 points
23 comments1 min readLW link
(arxiv.org)