Richard_Ngo comments on Richard Ngo’s Shortform

Richard_Ngo 26 May 2021 10:58 UTC
5 points
Yudkowsky mainly wrote about recursive self-improvement from a perspective in which algorithms were the most important factors in AI progress—e.g. the brain in a box in a basement which redesigns its way to superintelligence.
Sometimes when explaining the argument, though, he switched to a perspective in which compute was the main consideration—e.g. when he talked about getting “a hyperexponential explosion out of Moore’s Law once the researchers are running on computers”.
What does recursive self-improvement look like when you think that data might be the limiting factor? It seems to me that it looks a lot like iterated amplification: using less intelligent AIs to provide a training signal for more intelligent AIs.
I don’t consider this a good reason to worry about IA, though: in a world where data is the main limiting factor, recursive approaches to generating it still seem much safer than alternatives.
- Daniel Kokotajlo 29 Jun 2021 11:48 UTC
  4 points
  Parent
  Perhaps a data-limited intelligence explosion is analogous to what we humans do all the time when we teach ourselves something. Out of the vast sea of information on the internet, we go get some data, and study it, and then use that to make a better opinion about what data we need next, and then repeat until we are at the forefront of the world’s knowledge. We start from scratch, with a vague understanding like “I should learn more economics, I don’t even know what supply and demand are” and then we end up publishing a paper on auction theory or something idk. This is a recurisve self improvement loop in data quality, so to speak, rather than data quantity.
- Viliam 26 May 2021 12:31 UTC
  2 points
  Parent
  What counts as self-improvement in the scenario governed by data?
  You can grab the whole internet, including scihub and library genesis, and then maybe hack all “smart” appliances worldwide… and after that I guess you need to construct some machines that will perform experiments for you.
  But none of this improves the machine’s “self”. With algorithms, the idea is that the machine would replace its own algorithms by better ones, once it gets the ability to invent and evaluate algorithms. With hardware, the idea is that the machine would replace its own hardware by faster ones, once it gets the ability to design and produce hardware. But replacing your data with better data, that… we usually don’t call self-improvement.
  Also, what kind of data are we talking about? Data about the real world, they have to come from the outside, by definition. (Unless they are data about physics that you can obtain by observing the physical properties of your own circuits, or something like that.) But there is also data in sense of precomputed cached results, like playing zillions of games of chess against yourself, and remembering which strategies were most successful. If this was the limiting factor… I guess it would be something like a bounded AIXI which hypothetically already has enough hardware to simulate a universe, it only need to make zillions of computations to find the one that is consistent with the observed data.
  - Richard_Ngo 27 May 2021 9:26 UTC
    2 points
    0
    Parent
    In the scenario governed by data, the part that counts as self-improvement is where the AI puts itself through a process of optimisation by stochastic gradient descent with respect to that data.
    You don’t need that much hardware for data to be a bottleneck. For example, I think that there are plenty of economically valuable tasks that are easier to learn than StarCraft. But we get StarCraft AIs instead because games are the only task where we can generate arbitrarily large amounts of data.