I don’t think any one person understands the Linux kernel anymore. It’s just too big. Same with modern CPUs.
An RNN is something that one person can create and then fail to understand. That’s not like the Linux kernel at all.
Correction: An RNN is something that a person working with a powerful general optimizer can create and then fail to understand.
A human without the optimizer can create RNNs by hand—but only of the small and simple variety.
Although the Linux kernel and modern CPUs are piecewise-understandable, whereas neural networks are not.
Lots of neural networks at an individual vertex level are a logistic regression model, or something similar, -- I think I understand those pretty well. Similarly: “I think I understand 16-bit adders pretty well.”
I don’t think any one person understands the Linux kernel anymore. It’s just too big. Same with modern CPUs.
An RNN is something that one person can create and then fail to understand. That’s not like the Linux kernel at all.
Correction: An RNN is something that a person working with a powerful general optimizer can create and then fail to understand.
A human without the optimizer can create RNNs by hand—but only of the small and simple variety.
Although the Linux kernel and modern CPUs are piecewise-understandable, whereas neural networks are not.
Lots of neural networks at an individual vertex level are a logistic regression model, or something similar, -- I think I understand those pretty well. Similarly: “I think I understand 16-bit adders pretty well.”