RSS

Derck Prinzhorn

Karma: 6

Minor Word­ing Changes Pro­duce Ma­jor Shifts in AI Behavior

26 Nov 2025 12:52 UTC
2 points
0 comments6 min readLW link

Low-Tem­per­a­ture Eval­u­a­tions Can Mask Crit­i­cal AI Behaviors

13 Nov 2025 20:12 UTC
8 points
1 comment4 min readLW link