Cleo Nardo comments on Jeremy Kalfus’s Shortform

Cleo Nardo 26 Apr 2026 23:13 UTC
4 points
1
You could say “reduces one component of the attack surface” or “closes off threat model X”. But “reduces risk to near-zero” is a tell that you aren’t using the right mindset.
- Jeremy Kalfus 27 Apr 2026 2:21 UTC
  1 point
  0
  Parent
  I agree it is a strong statement, but I genuinely cannot think of a way a model could otherwise self-exfiltrate its weights, insofar as the actual numbers don’t exist anywhere digitally once the model is run.
  
  I am not explicitly skilled in this area, so take what I say with a grain of salt, but my lack of knowledge is not at all a reason why what I have said is wrong.