Shankar Sivarajan comments on Shortform

Shankar Sivarajan 5 Dec 2025 18:20 UTC
2 points
0
The primary application of “safety research” is improving refusal calibration, which, at least from a retail client’s perspective, is exactly like a capability improvement: it makes no difference to me whether the model can’t satisfy my request or can but won’t. It’s easy to demonstrate differences in this regard – simply show one model refusing a request another fulfills – so I disagree that this would cause clients to be “dissuaded from AI in general.”
- Cleo Nardo 5 Dec 2025 18:35 UTC
  2 points
  −2
  Parent
  I disagree that the primary application of safety research is improving refusal calibration. This take seems outdated by ~12 months.