RSS

Cam

Karma: 186

I help run www.geodesicresearch.org

Ar­chi­tec­tures for In­creased Ex­ter­nal­i­sa­tion of Reasoning

26 Nov 2025 20:24 UTC
17 points
1 comment13 min readLW link

Gen­er­al­i­sa­tion Hack­ing: a first look at ad­ver­sar­ial gen­er­al­i­sa­tion failures in de­liber­a­tive alignment

17 Nov 2025 21:44 UTC
30 points
0 comments8 min readLW link

Open-weight train­ing prac­tices and im­pli­ca­tions for CoT monitorability

4 Nov 2025 10:49 UTC
15 points
0 comments9 min readLW link

Cam’s Shortform

Cam9 Feb 2025 17:32 UTC
1 point
9 comments1 min readLW link