gugu comments on The Sweet Lesson: AI Safety Should Scale With Compute

gugu 30 May 2025 16:40 UTC
1 point
0
That’s why it’s called… *scalable* alignment? :D
Somewhat tongue-in-cheeck, but I think I am sort of confused by what is the core news here.
- Jesse Hoogland 30 May 2025 21:52 UTC
  2 points
  0
  Parent
  “Scalable” in the sense of “scalable oversight” refers to designing methods for scaling human-level oversight to systems beyond human-level capabilities. This doesn’t necessarily mean the same thing as designing methods for reliably converting compute into more alignment. In practice, techniques for scalable oversight like debate, IDA, etc. often have the property that they scale with compute, but this is left as an implicit desideratum rather than an explicit constraint.