Steven Byrnes comments on Contra Anton 🏴‍☠️ on Kolmogorov complexity and recursive self improvement

Steven Byrnes 30 Jun 2023 13:25 UTC
5 points
0
I had a kinda different take (copied from twitter)
This tweet is an interesting argument but I think a central flaw is ignoring how long the program has to run to produce the prediction.
For example, “AlphaZero-after-1-game-of-self-play” and “AlphaZero-after-10²⁰-games-of-self-play” have essentially the same Kolmogorov complexity. After all, they have the exact same source code, apart from like 4 characters that specify the number of games to play. But there’s a real sense in which the latter is better at Go than the former. Specifically, it’s better in the sense of “I don’t want to sit around while it does 10²⁰ games of self-play, I want to play Go right now.”
Another way to think about this argument: Suppose AI_1 builds AI_2, and then AI_2 does X. Well, it’s also true that “AI_1 did X”—specifically, “AI_1 did X” by building AI_2.
In a certain sense, this is true! But that’s a pretty weird way to think about things! AI_2 does in fact exist here!
K-complexity asks us to forget about AI_2, by focusing the discussion on what happens given infinite time, as opposed to how it happens and how long it takes.
Then after sleeping on it I tweeted again:
I think the core true point in atroyn’s argument is: there is a-priori-unpredictable complexity in the world that can’t be deduced from an armchair, but rather has to be observed, and making a “more intelligent successor” does not substitute for that.
If you flip a coin and don’t tell me, then I don’t know whether it’s heads or tails. And I also can’t make a “more intelligent successor” that knows whether it’s heads or tails.
This is entirely true! But I claim people talking about recursive self-improvement are not making that mistake.
For example:
- There’s an “overhang” of possible logical inferences that an AI could make on the basis of its existing knowledge, but doesn’t (e.g. if I tell an AI the axioms of math, it doesn’t instantaneously prove every possible theorem),
- There’s an “overhang” of possible input data that an AI could download and scrutinize, but doesn’t (e.g. as of this writing, I believe no AI has watched all 100,000 years of YouTube)
- There’s an “overhang” of possible plans that an AI could execute but doesn’t (e.g. an early AGI is unlikely to be simultaneously doing every remote job on the planet, while also starting a zillion new ambitious projects in parallel).
So an AI could self-improve in a way that allows it to go farther and faster on those metrics.
An obvious example is tweaking the assembly code to make the same AI run faster.
I also want to put self-replication into this category: going from “one instance of an AI” to “a million instances of the same AI running in parallel and collaborating” (e.g. by buying or stealing additional compute). If you think about it, I claim that should totally count as “self-improvement”, because after all one AI system is creating a more powerful AI “system”. The latter “system” is composed of many instances of the original AI, but so what? It should still count, IMO.
I think the OP here is also valid (and complementary).
- Ilio 30 Jun 2023 18:42 UTC
  2 points
  1
  Parent
  Each of these points look valid, but there’s a much simpler refutation: « Any good enough intelligence is smart enough to distribute part of its cognition to external devices. ».
  
  Application: either my code includes wikipedia and whoever might change wikipedia just before I consult it, or it’s Kolmogorov complexity does not fully capture my capabilities. In a sense, this is showing the impact of putting too much confidence on a debatable picture of our capabilities and limitations as a single agent working from some cockpit.