johnswentworth comments on The Case Against AI Control Research

johnswentworth 23 Jan 2025 19:47 UTC
9 points
2
No, more like a disjunction of possibilities along the lines of:
- The critical AGIs come before huge numbers of von Neumann level AGIs.
- At that level, really basic stuff like “just look at the chain of thought” turns out to still work well enough, so scheming isn’t a hard enough problem to be a bottleneck.
- Scheming turns out to not happen by default in a bunch of von Neumann level AGIs, or is at least not successful at equilibrium (e.g. because the AIs don’t fully cooperate with each other).
- “huge numbers of von Neumann level AGIs” and/or “scheming” turns out to be the wrong thing to picture in the first place, the future is Weirder than that in ways which make our intuitions about von Neumann society and/or scheming not transfer at all.
Pile together the probability mass on those sorts of things, and it seems far more probable than the prototypical scheming story.
- Buck 23 Jan 2025 19:56 UTC
  6 points
  0
  Parent
  In your post, you emphasize the slop problem. I think that the slop problem is probably much harder to solve if those AIs are scheming. I guess you’re saying that it’s just unlikely that the AIs are scheming at the point where you’re worried about the slop problem?
  - johnswentworth 23 Jan 2025 20:09 UTC
    9 points
    2
    Parent
    Yeah, basically. Or at least unlikely that they’re scheming enough or competently enough for it to be the main issue.
    For instance, consider today’s AIs. If we keep getting slop at roughly the current level, and scheming at roughly the current level, then slop is going to be the far bigger barrier to using these things to align superintelligence (or nearer-but-strong intelligence).