J Bostock comments on Brendan Long’s Shortform

J Bostock 9 May 2026 12:16 UTC
6 points
0
Perhaps the model is probably updating its prior on “I am in an alignment eval” relative to to “I am in a ridiculous roleplay scenario”