abstractapplic comments on Linch’s Shortform

abstractapplic 18 Apr 2026 11:27 UTC
3 points
0
That seems inherently hard to do systematically, but easy to do a fuzzy version of anecdotally. Someone could just post something on LW asking AI-using users to ask “what do you think the probability of this being an Eval is?” to their AIs in the middle of organic use, and report back.
And by ‘someone’, I mean me. I could do that. So I will.
- Linch 20 Apr 2026 0:00 UTC
  12 points
  7
  Parent
  a more systematic version of this is for AI companies to randomly poll models in production after some series of user queries and ask them “what do you think the probability of this being an Eval is?” and/or more sophisticated mech interp variations.