Rahul N comments on Why AI Evaluation Regimes are bad

Rahul N 14 Mar 2026 18:37 UTC
1 point
0
Going through your beliefs -
1. The EU AI Act’s Code of Practise (Safety and Security chapter) mandates external evaluations for systemic risks. That’s definitely a start—so regulations are getting there.
2. I think the way the Act is setup is that labs do their testing and external orgs add an extra perspective so it’s not just labs high five-ing themselves.
3. Isn’t some overlap in personnel to be expected considering that the AI Safety field is small?
Also, I fail to see how evals take away from passing new regulations. Evals are, like other work in this field, building tech that will be only adopted / impactful when complemented with governance / regulations or some other incentives.