Johan David Bonilla comments on Retrying vs Resampling in AI Control

Johan David Bonilla 30 May 2026 0:00 UTC
3 points
0
At some point doesn’t the model just get good enough that you can’t tell its bad actions from its normal ones no matter how much you sample? Does retrying vs resampling still matter then, or is that a different problem than this paper is about?
- james.lucassen 30 May 2026 3:48 UTC
  6 points
  3
  Parent
  Different problem! AI control in general is a stopgap measure, only applicable to models up to some yet-unknown capability level where it becomes intractable and we have to get our safety another way (such as alignment). But we hope retrying and resampling (and better control techniques in general) can increase the maximum capability level where control works.