The best simple argument for Pausing AI?

Not saying we should pause AI, but consider the following argument:

  1. Alignment without the capacity to follow rules is hopeless. You can’t possibly follow laws like Asimov’s Laws (or better alternatives to them) if you can’t reliably learn to abide by simple constraints like the rules of chess.

  2. LLMs can’t reliably follow rules. As discussed in Marcus on AI yesterday, per data from Mathieu Acher, even reasoning models like o3 in fact empirically struggle with the rules of chess. And they do this even though they can explicit explain those rules (see same article). The Apple “thinking” paper, which I have discussed extensively in 3 recent articles in my Substack, gives another example, where an LLM can’t play Tower of Hanoi with 9 pegs. (This is not a token-related artifact). Four other papers have shown related failures in compliance with moderately complex rules in the last month.

  3. If you can’t follow things such as Asimov’s Laws, you have no business running the world’s miltiary, infrastructure, etc. Perhaps we should pause widespread rollout of Generative AI in safety-critical domains — unless and until it can be relied on to follow rules with significant greater reliability.

Corollary: hyping LLMs is bad, because it increases the degree of global deployment of systems that lack a basic prerequisite for alignment – the ability of systems to comply with rules. This in turn elevates the risk of AI causing catastrophic harm to humanity.

Non ad-hominem comments are welcome.