StanislavKrym comments on The Problem with Defining an “AGI Ban” by Outcome (a lawyer’s take).

StanislavKrym 20 Sep 2025 17:15 UTC
8 points
−11
I would define an AGI system as anything except for a Verifiably Incapable System.
My take on the definition of a VIS is that it can be constructed as follows.
1. It can be trained ONLY on a verifiaby safe dataset (for example, a dataset which is too narrow, like structures of proteins).
2. Alternatively it can be a CoT-based architecture with at most [DATA EXPUNGED] compute used for RL, since the scaling laws are likely known.
3. Finally, it could be something explicitly approved by a monopolistic governance organ and a review of opinions of researchers.
A potential approval protocol could be the following, but we would need to ensure that experiments can’t lead to an accidental creation of the AGI.
1. Discover the scaling laws for the new architecture by conducting experiments (e.g. a neuralese model could have capabilities similar to those of a CoT-trained model RLed by the same number of tasks with the amount of bits transferred being the same. But the model is to be primitive).
2. Extrapolate the capabilities to the level where they could become dangerous or vulnerable to sandbagging.
3. If capabilities are highly unlikely to become dangerous at a scaling-up, then one can train a new model with a similar architecture and use as many benchmarks as humanely possible.
What links here?
- Mateusz Bagiński 20 Sep 2025 18:38 UTC
  15 points
  5
  Parent
  I would define an AGI system as anything except for a Verifiably Incapable System.
  This is more like “not-obviously not-AGI”.
  But besides that, yeah, it seems like an OK starting point for thinking about proper definitions for the purpose of a ban.
- aphyer 21 Sep 2025 13:02 UTC
  4 points
  3
  Parent
  Would you define ‘nuclear weapon’ as ‘anything not produced in a way that verifiably could not contain any nuclear material’?
  (Keep in mind that this would categorize e.g. a glass of tap water as a nuclear weapon.)
  - Karl Krueger 21 Sep 2025 16:21 UTC
    10 points
    4
    Parent
    There were no nuclear weapons in 1925; so anything that existed in 1925 is known to not be a nuclear weapon. (Moreover, anything built to a 1925 design isn’t one either.)
    The smallest critical mass is greater than 1kg; so anything smaller than 1kg is not a nuclear weapon (though it may be part of one).
    Nuclear weapons are made of metal, not wood, cloth, paper, or clay; so anything made of wood, cloth, paper, or clay is not a nuclear weapon. (Thus for instance no conventionally printed book is a nuclear weapon, which is convenient for maintaining freedom of the press.)
  - Katalina Hernandez 21 Sep 2025 13:38 UTC
    10 points
    7
    Parent
    A wise man called @Leon Lang told me recently: “a definition that defines something as not being another thing, is flawed”.
  - StanislavKrym 21 Sep 2025 14:35 UTC
    1 point
    0
    Parent
    Critical masses related to nuclear weapons can be found out at the risk of a nuclear explosion at worst and without any risks at best, by attacking nuclei with neutrons and studying the energies of particles and sections of reactions.
    However, we don’t know the critical mass for the AGI. While an approval protocol could be an equivalent of determiming critical mass for every new architecture, accidental creation of an AGI capable of breaching containment or being used for AI research, sabotaging alignment work and aligning the ASI to the AGI’s whims instead of mankind’s ideas would be equivalent to a nuclear explosion burning the Earth’s atmosphere.
    P.S. The possibility that a nuclear explosion could burn the Earth’s atmosphere was considered by scientists working on the Manhattan project.
    - aphyer 21 Sep 2025 16:46 UTC
      2 points
      0
      Parent
      Fusion is also a thing. A glass of tap water contains (admittedly a very small amount) of deuterium.