Gunnar_Zarncke comments on Measuring intelligence and reverse-engineering goals

Gunnar_Zarncke 11 Aug 2025 10:10 UTC
4 points
0
Intelligence solves problems, by guiding behavior to produce local extropy. It is indicated by the avoidance of probable outcomes, which is equivalent to the construction of information.
This amounts to something similar to the convergent instrumental goal definition; achieving sufficiently specific outcomes involves pursuing convergent instrumental goals.
I like the idea of looking for convergent instrumental goals, but I think this section specifically misses the opportunity to formalize the local extropy production or generally to look for information-theoretical measures.
If we assume a modeling of an agent in terms of his Markov blanket (ignoring issues with that for now^[1]), then we could define the generalized capability of an agent in terms of that.
$C a p a b i l i t y = I_{p r e d} + I_{c t r l} - β H (I) - S$
Where
- $I_{p r e d}$ – “bits you can see coming”:
  The mutual information $I (I_{t}; S_{t + 1})$ between the agent’s internal state $I_{t}$ and its next sensory state $S_{t + 1}$ quantifies how much the agent’s current “belief state” predicts what it will sense next.
- $I_{c t r l}$ – “bits you can steer”:
  The mutual information $I (A_{t}; E_{t + 1})$ between the agent’s action $A_{t}$ and the next external state $E_{t + 1}$ measures how much the agent’s outputs causally structure the world beyond its blanket.
- $H (I)$ – “bits you have to keep alive”:
  Shannon entropy of the internal state $I_{t}$ . This is the size of the agent’s memory in bits. The coefficient β turns that size into a cost, reflecting physical maintenance energy and complexity overhead (e.g. Landauer limit).
- S – “bits you fail to see coming”:
  Expected negative log-likelihood $S = E [- log P (S_{t + 1} ∣ I_{t})]$ of the next sensory state given the internal state. This is the “leftover unpredictability” after using the best model encoded in $I_{t}$ , i.e. the sensory free energy.
1. ^
  Instead of the hard causal independence, it may be possible to define a boundary as the maximal separation in mutual information between clusters.