RSS

Jorio Cocola

Karma: 307

MATS 8.0 Scholar

Weird Gen­er­al­iza­tion & In­duc­tive Backdoors

11 Dec 2025 18:18 UTC
152 points
8 comments8 min readLW link

OpenAI fine­tun­ing met­rics: What is go­ing on with the loss curves?

24 Nov 2025 18:29 UTC
41 points
5 comments2 min readLW link

Con­cept Poi­son­ing: Prob­ing LLMs with­out probes

5 Aug 2025 17:00 UTC
60 points
5 comments13 min readLW link

Selec­tive Gen­er­al­iza­tion: Im­prov­ing Ca­pa­bil­ities While Main­tain­ing Alignment

16 Jul 2025 21:25 UTC
71 points
6 comments7 min readLW link