RSS

Derck Prinzhorn

Karma: 5

Minor Word­ing Changes Pro­duce Ma­jor Shifts in AI Behavior

26 Nov 2025 12:52 UTC
2 points
0 comments6 min readLW link

Low-Tem­per­a­ture Eval­u­a­tions Can Mask Crit­i­cal AI Behaviors

13 Nov 2025 20:12 UTC
7 points
0 comments4 min readLW link