Kabir Kumar comments on Kabir Kumar’s Shortform

Kabir Kumar 14 Oct 2025 13:15 UTC
1 point
0
One of the major problems with this atm is that most ‘alignment’, ‘safety’, etc evals dont specify or define exactly what they’re trying to measure.
- Kabir Kumar 14 Oct 2025 13:15 UTC
  1 point
  0
  Parent
  so for this and other reasons, its hard to say when an eval has been truly successfully ‘red teamed’