Buck comments on The case for ensuring that powerful AIs are controlled

Buck 14 Mar 2025 0:24 UTC
LW: 2 AF: 2
0
AF
I am not sure I agree with this change at this point. How do you feel now?
- ryan_greenblatt 14 Mar 2025 0:35 UTC
  LW: 4 AF: 4
  2
  AF Parent
  I think I disagree some with this change. Now I’d say something like “We think the control line-of-defense should mostly focus on the time before we have enough evidence to relatively clearly demonstrate the AI is consistently acting egregiously badly. However, the regime where we deploy models despite having pretty strong empirical evidence that that model is scheming (from the perspective of people like us), is not out of scope.”