RSS

James Chua

Karma: 632

https://​​jameschua.net/​​about/​​

Ac­ti­va­tion Or­a­cles: Train­ing and Eval­u­at­ing LLMs as Gen­eral-Pur­pose Ac­ti­va­tion Explainers

18 Dec 2025 20:21 UTC
141 points
11 comments8 min readLW link
(arxiv.org)

OpenAI fine­tun­ing met­rics: What is go­ing on with the loss curves?

24 Nov 2025 18:29 UTC
41 points
5 comments2 min readLW link

Back­door aware­ness and mis­al­igned per­sonas in rea­son­ing models

20 Jun 2025 23:38 UTC
34 points
8 comments6 min readLW link

Thought Crime: Back­doors & Emer­gent Misal­ign­ment in Rea­son­ing Models

16 Jun 2025 16:43 UTC
68 points
2 comments8 min readLW link

OpenAI Re­sponses API changes mod­els’ behavior

11 Apr 2025 13:27 UTC
53 points
6 comments2 min readLW link