ZY comments on Zach Stein-Perlman’s Shortform

ZY 26 Sep 2025 18:28 UTC
1 point
0
The basic approach is: do evals; find weaker capabilities than other open-weights models; infer that it’s safe to release weights.
Curious—what made you think this is new to Code World Model comparing to other Meta releases?
- Zach Stein-Perlman 26 Sep 2025 18:32 UTC
  2 points
  0
  Parent
  I don’t think it’s very new. iirc it’s suggested in Meta’s safety framework. But past evals stuff (see the first three bullets above) has been more like the model doesn’t have dangerous capabilities than the model is weaker than these specific other models. Maybe in part because previous releases have been more SOTA. I don’t recall past releases being like safe because weaker than other models.