RSS

Daan Henselmans

Karma: 31

Computational linguist, writer, AI dev. Currently running AI safety research.

Low-Tem­per­a­ture Eval­u­a­tions Can Mask Crit­i­cal AI Behaviors

13 Nov 2025 20:12 UTC
7 points
0 comments4 min readLW link

Thin Align­ment Can’t Solve Thick Problems

Daan Henselmans27 Apr 2025 22:42 UTC
11 points
2 comments9 min readLW link

Align­ment Can Re­duce Perfor­mance on Sim­ple Eth­i­cal Questions

Daan Henselmans3 Feb 2025 19:35 UTC
16 points
7 comments6 min readLW link