I think trying to create intelligence via gradient descent is a dead end, so we’ll have to switch to an entirely different and more expensive architecture. I’ll probably write a post about that soon.
I think trying to create intelligence via gradient descent is a dead end, so we’ll have to switch to an entirely different and more expensive architecture. I’ll probably write a post about that soon.