RSS

James Sullivan

Karma: 58

I’m a software engineer that is interested in AI, futurism, space, and the big questions of life.

https://​​www.linkedin.com/​​in/​​jamessullivan092/​​

What Rea­son­ing Steps Cause Align­ment Fak­ing?

James Sullivan28 Apr 2026 4:37 UTC
3 points
0 comments9 min readLW link
(open.substack.com)

Are we al­ign­ing the model or just its mask?

James Sullivan27 Mar 2026 2:10 UTC
12 points
0 comments10 min readLW link
(substack.com)

Play­ing Dumb: De­tect­ing Sand­bag­ging in Fron­tier LLMs via Con­sis­tency Checks

James Sullivan13 Jan 2026 19:28 UTC
11 points
0 comments5 min readLW link

Jailbreak­ing Claude 4 and Other Fron­tier Lan­guage Models

James Sullivan15 Jun 2025 0:31 UTC
1 point
0 comments3 min readLW link
(open.substack.com)

How do AI agents work to­gether when they can’t trust each other?

James Sullivan6 Jun 2025 3:10 UTC
17 points
0 comments8 min readLW link
(jamessullivan092.substack.com)

Devel­op­men­tal Stages in Multi-Prob­lem Grokking

James Sullivan29 Sep 2024 18:58 UTC
4 points
0 comments6 min readLW link