Zac Hatfield-Dodds answers Is anyone working on formally verified AI toolchains?

Zac Hatfield-Dodds 13 Mar 2024 1:01 UTC
12 points
2
Safety properties aren’t the kind of properties you can prove; they’re statements about the world, not about mathematical objects. I very strongly encourage anyone reading this comment to go read Leveson’s Engineering a Safer World (free pdf from author) through to the end of chapter three—it’s the best introduction to systems safety that I know of and a standard reference for anyone working with life-critical systems. how.complexsystems.fail is the short-and-quotable catechism.

I’m not really sure what you mean by “AI toolchain”, nor what threat model would have a race-condition present an existential risk. More generally, formal verification is a research topic—there’s some neat demonstration systems and they’re used in certain niches with relatively small amounts of code and compute, simple hardware, and where high development times are acceptable. None of those are true of AI systems, or even libraries such as Pytorch.

For flavor, some of the most exciting developments in formal methods: I expect the Lean FRO to improve usability, and ‘autoformalization’ tricks like Proofster (pdf) might also help—but it’s still niche, and “proven correct” software can still have bugs from under-specified components, incorrect axioms, or outright hardware issues (e.g. Spectre, Rowhammer, cosmic rays, etc.). The seL4 microkernel is great, but you still have to supply an operating system and application layer, and then ensure the composition is still safe. To test an entire application stack, I’d instead turn to Antithesis, which is amazing so long as you can run everything in an x86 hypervisor (with no GPUs).

(as always, opinions my own)