Thanks for your reply!
A few points where clarification would help, if you don’t mind (feel free to skip some):
What are the capabilities of the “generative model”? In general, the term seems to be used in various ways. e.g.
Sampling from the learned distribution (analogous to GPT-3 at temp=1)
Evaluating the probability of a given point
Producing the predicted most likely point (analogous to GPT-3 at temp=0)
Is what we’re predicting the input at the next time step? (Sometimes “predict” can be used to mean filling in missing information, but that doesn’t seem to make sense in this context.) Also, I’m not sure what I mean by “time step” here.
The “input signal” here is coming from whatever is wired into the cortex, right? Does it work to think of this as a vector in ?
Is the contextual information just whatever is the current input, plus whatever signals are still bouncing around?
Also, the capability described may be a bit too broad, since there are some predictions that the cortex seems to be bad at. Consider predicting the sum of two 8-digit integers. Digital computers compute that easily, so it’s fundamentally an easy problem, but for humans to do it requires effort. Yet for some other predictions, the cortex easily outperforms today’s digital computers. What characterizes the prediction problems that the cortex does well?
I find myself returning to this because the idea of a “common cortical algorithm” is intriguing.
It seems to me that if there is a “common cortical algorithm” then there is also a “common cortical problem” that it solves. I suspect it would be useful to understand what this problem is.
(As an example of why isolating the algorithm and problem could be quite different, consider linear programming. To solve a linear programming problem, you can choose a simplex algorithm or an interior-point method, and these are fundamentally different approaches that are both viable. It’s also quite a bit easier to state linear programming as a problem than it is to describe either solution approach.)
Do you have a view on the most plausible candidates for a “common cortical problem” (CCP)? The tricky aspects that come to mind: not being too narrow (i.e. the CCP should include (almost) everything the CCA can do), not being too broad (i.e. the CCA should be able solve (almost) every instance of the CCP), and not being too vague (ideally precise enough that you could actually make a benchmark test suite to evaluate proposed solutions).