Kaj_Sotala comments on AIFoom Debate—conclusion?

Kaj_Sotala 5 Mar 2016 8:41 UTC
7 points
For a dissenting view, there’s e.g. jacob_cannell’s recent comment about the implications of AlphaGo.

Critics like to point out that DL requires tons of data, but so does the human brain. A more accurate comparison requires quantifying the dataset human pro go players train on.

A 30 year old asian pro will have perhaps 40,000 hours of playing experience (20 years 50 40 hrs/week). The average game duration is perhaps an hour and consists of 200 moves. In addition, pros (and even fans) study published games. Reading a game takes less time, perhaps as little as 5 minutes or so.

So we can estimate very roughly that a top pro will have absorbed between 100,000 games to 1 million games, and between 20 to 200 million individual positions (around 200 moves per game) .

AlphaGo was trained on the KGS dataset: 160,00 games and 29 million positions. So it did not train on significantly more data than a human pro. The data quantities are actually very similar.

Furthermore, the human’s dataset is perhaps of better quality for a pro, as they will be familiar with mainly pro level games, whereas the AlphaGo dataset is mostly amateur level.

The main difference is speed. The human brain’s ‘clockrate’ or equivalent is about 100 hz, whereas AlphaGo’s various CNNs can run at roughly 1000hz during training on a single machine, and perhaps 10,000 hz equivalent distributed across hundreds of machines. 40,000 hours—a lifetime of experience—can be compressed 100x or more into just a couple of weeks for a machine. This is the key lesson here.
- Mark_Friedenbach 5 Mar 2016 8:53 UTC
  2 points
  Parent
  That’s a terrible argument. AlphaGo represents a general approach to AI, but its instantiation on the specific problem of Go tightly constrains the problem domain and solution space. Real life is far more combinatorial still, and an AGI requires much more expensive meta-level repeated cognition as well. You don’t just solve one problem, you also look at all past solved problems and think about his you could have solved those better. That’s quadratic blowup.
  
  Tl;Dr speed of narrow AI != speed of general AI.
  - Gunnar_Zarncke 5 Mar 2016 12:27 UTC
    6 points
    Parent
    But what if a general AI could generate specialized narrow AIs? That is something the human brain cannot do but an AGI could. Thus speed of general AI = speed of AI narrow + time to specialize.
    - V_V 7 Mar 2016 15:50 UTC
      0 points
      Parent
      
      But what if a general AI could generate specialized narrow AIs?
      
      How is it different than a general AI solving the problems by itself?
      - Gunnar_Zarncke 7 Mar 2016 19:51 UTC
        4 points
        Parent
        It isn’t. At least not in my model of what an AI is. But Mark_Friedenbach seems to operate under a model where this is less clear or the consequences of the capability of an AI creating these kind of specialized sub agents seem not to be taken into account enough.
  - jacob_cannell 7 Mar 2016 21:04 UTC
    5 points
    Parent
    
    AlphaGo represents a general approach to AI, but its instantiation on the specific problem of Go tightly constrains the problem domain and solution space ..
    
    Sure, but that wasn’t my point. I was addressing key questions of training data size, sample efficiency, and learning speed. At least for Go, vision, and related domains, the sample efficiency of DL based systems appears to be approaching that of humans. The net learning efficiency of the brain is far beyond current DL systems in terms of learning per joule, but the gap in terms of learning per dollar is less, and closing quickly. Machine DL systems also easily and typically run 10x or more faster than the brain, and thus learn/train 10x faster.
  - MrMind 7 Mar 2016 9:23 UTC
    1 point
    Parent
    
    AlphaGo represents a general approach to AI
    
    Although I disagree that fooming will be slow, from what I’ve learned studying it I would say that its approach is not easy to generalize.
    AlphaGo draws its power partly due to the step where an ‘intuitive’ neural net is created, using millions of self-play from another already supervisedly trained net. But the training can be accurate because the end positions and the winning player are clearly defined once the game is over. This allows a precise calculation of the outcome function that the intuitive neural net is trying to learn.
    Unsupervised learners interacting with an environment that has open ontologies will have a much harder time to come up with this kind of intuition-building step.