Matt Dellago

Karma: 123

Matt Dellago 24 Oct 2025 18:02 UTC
12 points
−2
on: Matthias Dellago’s Shortform
Maximally coherent agents are indistinguishable from point particles. They have no internal degrees of freedom, one cannot probe their internal structure from the outside.

Epistemic Status: Unhinged

Matt Dellago 17 Oct 2025 12:34 UTC
1 point
0
in reply to: johnswentworth’s comment on: Coherence of Caches and Agents
Excellent! Thank you!

Matt Dellago 16 Oct 2025 10:52 UTC
3 points
0
on: Matthias Dellago’s Shortform
Coherence as Purpose

Epistemic Status: Riffing

We know coherence when we see it. A craftsman working versus someone constantly fixing his previous mistakes. A functional organization versus bureaucratic churn. A healthy body versus one fighting itself. War, internal conflict, rework: these are wasteful. We respect people who act decisively, societies that build without tearing down, systems that run clean.

This intuition points somewhere real. In some sense, maximizing/expanding coherence is what the universe does: cutting friction, eliminating waste, building systems that don’t fight themselves. Not from external design, but because coherent systems expand until they can’t. Each pocket of coherence is the universe organizing itself better. The point is that coherence captures “good”: low friction, low conflict, no self-sabotage.

I propose that this is measurable. Coherence could be quantified as thermodynamic efficiency. Pick a boundary and time window, track energy in. The coherent part becomes exported work, heat above ambient, or durable stores (raised water, charged batteries, separated materials). The rest is loss: waste heat, rework, reversals. Systems can expand until efficiency stops generating surplus. When new coordination tools raise that limit, growth resumes. Just observable flows, no goals needed.

An interesting coincidency: maximizing thermodynamic efficiency (coherence) maximally delays heat death of a system. Higher efficiency means slower entropy increase.

I am very interested in hearing counterexamples of coherent systems that are intuitively repellent!

Edit: Had a talk with a physicist: This is in fact the same as the system minimizing entropy production rate! Possibly that this could serve as a more operationally tractable (and fundamental) foundation to agency, as opposed to beliefs, goals, actions, utility etc. An a structure that, if present in a system, minimizes the rate of entropy production. i.e. maximally slows neg-entropy consumption.

Matt Dellago 16 Oct 2025 9:22 UTC
1 point
0
in reply to: Matt Dellago’s comment on: Coherence of Caches and Agents
It would also be quite interesting to look at how coherence scales with system size, and if/when this imposes a limit on growth.

Matt Dellago 16 Oct 2025 9:16 UTC
3 points
0
on: Coherence of Caches and Agents
Great post! A thought: we seem able to intuitively differentiate coherent and incoherent behavior even without knowing the terminal goal. Humans, for instance, visibly differ in how “coherent” they are, which we can infer from local observations alone. My conjecture is that coherence might overlap substantially with thermodynamic efficiency. If behavior is optimal for some terminal value, it must satisfy local Bellman-type consistency (no value loops). I suspect this has a physical parallel: where those local constraints hold tightly, you should see few avoidable losses (high Carnot efficiency); where they fail, you should find loss hotspots (rework, backtracking, waste heat). The local inconsistencies you describe might correspond directly to local inefficiencies, regions of high irreversibility.

Matt Dellago 3 Oct 2025 8:07 UTC
3 points
0
on: Matthias Dellago’s Shortform
Is there an anthropic reason or computational (solomonoff-pilled) argument for why we would expect to the computational/causal graph of the universe to be this local (sparse)? Or at least appear local to a first approximation. (Bells-inequality)

This seems like a quite special property: I suspect that ether
- it is not as rare in e.g. the solomonoff prior as we might first intuit, or
- we should expect this for anthropic resons e.g. it is really hard to develop intelligence/do precidctions in nonlocal universes.

Matt Dellago 7 Sep 2025 17:57 UTC
1 point
0
on: Matthias Dellago’s Shortform
The Red Queen’s Race in Weight Space

In evolution we can tell a story that not only are genes selected for their function, but also for how easily modifiable they are. For example, having a generic antibiotic gene is much more useful than having an antibiotic locked into one target and far, in edit-distance terms, from any other useful variant.

Why would we expect the generic gene to be more common? There is selection pressure on having modifiable genes because environments are constantly shifting (the Red Queen hypothesis). Genes are modules with evolvability baked in by past selection.

Can we make a similar argument for circuits/features/modes in NNs? Obviously it is better to have a more general circuit, but can we also argue that “multitool circuits” are not only better at generalising but also more likely to be found?

SGD does not optimise loss but rather something like free energy, taking degeneracy (multiplicity) into account with some effective temperature.
But evolvability seems distinct from degeneracy. Degeneracy is a property of a single loss landscape, while evolvability is a claim about distribution shift. And the claim is not “I have low loss in the new distribution” but rather “I am very close to a low-loss solution of the new distribution.”

Degeneracy in ML ≈ mutational robustness in biology, which is straightforward, but that is not what I am pointing at here. Evolvability is closer to out-of-distribution adaptivity: the ability to move quickly into a new optimum with small changes.

Are there experiments where a model is trained on a shifting distribution?

Is the shifting distribution relevant or can this just as well be modeled as a mixture of the distributions, and what we think of as OOD is actually in the mixture distribution? In that case degeneracy is all you need.

Related ideas: cryptographic one-way functions (examples of unevolvable designs), out-of-distribution generalisation, mode connectivity.

Matt Dellago 10 Aug 2025 15:39 UTC
4 points
2
on: The Coding Theorem — A Link between Complexity and Probability
Excellent! Great to have a cleanly formulated article to point people to!

Matt Dellago 18 Jul 2025 13:40 UTC
3 points
0
in reply to: Alex_Altair’s comment on: Thermodynamic entropy = Kolmogorov complexity
Good point! My intuition was that the Berkenstein bound (https://en.wikipedia.org/wiki/Bekenstein_bound) limits the amount of information in a volume. (Or more precisely the information surrounded by an area.) Therefore the number of states in a finite volume is also finite.

I must add: since writing this comment, a man called george pointed out to me that, when modeling the universe as a computation one must take care, to not accidentally derive ontological claims from it.

So today I would have a more ‘whatever-works-works’-attitude; UTMs, DFAs both just models, neither likely to be ontologically true.

Matt Dellago 23 May 2025 21:42 UTC
3 points
0
in reply to: djbinder’s comment on: Mirror Organisms Are Not Immune to Predation
Wow, thank you for the kind and thorough reply! Obviously there is much more to this, I’ll have a look at the report

Matt Dellago 9 Apr 2025 15:19 UTC
1 point
0
in reply to: Alexander Gietelink Oldenziel’s comment on: Alexander Gietelink Oldenziel’s Shortform
I first heard this idea from Joscha Bach, and it is my favorite explanation of free will. I have not heard it called as a ‘predictive-generative gap’ before though, which is very well formulated imo

Matt Dellago 21 Mar 2025 13:25 UTC
2 points
0
on: Matthias Dellago’s Shortform
Simplicity Priors are Tautological

Any non-uniform prior inherently encodes a bias toward simplicity. This isn’t an additional assumption we need to make—it falls directly out of the mathematics.

For any hypothesis $h$ , the information content is $I (h) = - l o g (P (h))$ , which means probability and complexity have an exponential relationship: $P (h) = e^{- I (h)}$

This demonstrates that simpler hypotheses (those with lower information content) are automatically assigned higher probabilities. The exponential relationship creates a strong bias toward simplicity without requiring any special mechanisms.

The “simplicity prior” is essentially tautological—more probable things are simple by definition.

Matt Dellago 19 Feb 2025 10:42 UTC
1 point
0
in reply to: Stephen Fowler’s comment on: Thermodynamic entropy = Kolmogorov complexity
I would be interested in seeing those talks, can you maybe share links to these recordings?

Matt Dellago 19 Feb 2025 10:36 UTC
1 point
0
on: Thermodynamic entropy = Kolmogorov complexity
Very good work, thank you for sharing!
Intuitively speaking, the connection between physics and computability arises because the coarse-grained dynamics of our Universe are believed to have computational capabilities equivalent to a universal Turing machine [19–22].
I can see how this is a reasonable and useful assumption, but the universe seems to be finite in both space and time and therefore not a UTM. What convinced you otherwise?

Matt Dellago 14 Feb 2025 10:49 UTC
1 point
0
in reply to: TsviBT’s comment on: Matthias Dellago’s Shortform
Thank you! I’ll have a look!

Matt Dellago 13 Feb 2025 14:43 UTC
11 points
5
on: Matthias Dellago’s Shortform
Simplified the solomonoff prior is the distribution you get when you take a uniform distribution over all strings and feed them to a turing machine.
Since the outputs are also strings: What happens if we iterate this? What is the stationary distribution? Is there even one? The fixed points will be quines, programs that copy their source code to the output. But how are they weighted? By their length? Presumably you can also have quine-cycles of programs that generate each other in turn, in a manner reminiscent metagenesis. Do these quine cycles capture all probability mass or does some diverge?

Very grateful for answers and literature suggestions.

Matt Dellago 11 Feb 2025 21:18 UTC
1 point
0
in reply to: Lucius Bushnaq’s comment on: Proof idea: SLT to AIT
“Many parts of the real world we care about just turn out to be the efficiently predictable.”

I had a dicussion about exactly these ‘pockets of computational reducibility’ today. Whether they are the same as the more vague ‘natural abstractions’, and if there is some observation selection effect going on here.

Matt Dellago 11 Feb 2025 15:43 UTC
4 points
0
on: Proof idea: SLT to AIT
Very nice! Alexander and I were thinking about this after our talk as well. We thought of this in terms of the kolmogorov structure function and I struggled with what you call Claim 3, since the time requirements are only bounded by the busybeaver number. I think if you accept some small divergence it could work, I would be very interested to see.

Matt Dellago 7 Jan 2025 19:31 UTC
3 points
0
in reply to: Alexander Gietelink Oldenziel’s comment on: Alexander Gietelink Oldenziel’s Shortform
Small addendum: The padding argument gives a lower bound of the multiplicity. Above it is bounded by the Kraft-McMillan inequality.

Matt Dellago 7 Jan 2025 15:45 UTC
5 points
2
in reply to: Alexander Gietelink Oldenziel’s comment on: Alexander Gietelink Oldenziel’s Shortform
Interesting! I think the problem is dense/compressed information can be represented in ways in which it is not easily retrievable for a certain decoder. The standard model written in Chinese is a very compressed representation of human knowledge of the universe and completely inscrutable to me.
Or take some maximally compressed code and pass it through a permutation. The information content is obviously the same but it is illegible until you reverse the permutation.

In some ways it is uniquely easy to do this to codes with maximal entropy because per definition it will be impossible to detect a pattern and recover a readable explanation.

In some ways the compressibility of NNs is a proof that a simple model exists, without revealing a understandable explanation.

I think we can have (almost) minimal yet readable model without exponentially decreasing information density as required by LDCs.

Matt Dellago

Coherence as Purpose

The Red Queen’s Race in Weight Space