So I guess the point then more becomes about general open source development of other countries where China is part of it and that people did not correctly predict this as something that would happen.
Something like distillation techniques for LLMs would be used by other countries and then profilerated and that the rationality community as a whole did not take this into account?
I’ll agree with you that Bayes points should be lost in prediction of theory of mind of nation states, it is quite clear that they would be interested in this from a macro-analysis perspective (I say in hindsight of course.)
I’m not sure that Deepseek is SOTA in terms of inherent development, it seems to me that they’re using some of the existing work from OpenAI, Deepmind & Anthropic but I might be wrong here, is there anything else that you’re pointing at?
There are many math and coding benchmarks where models from DeepSeek, Ali baba and tencent are now leading, and definitely leading what was SOTA a year ago. If you don’t want to take my word for it I can dig them up.
No, we good. I was just operating under the assumption that deepseek was just doing distilling of OpenAI but it doesnt seem to be the only good ML company from China. There’s also a bunch of really good ML researchers from China so I agree at this point.
So I guess the point then more becomes about general open source development of other countries where China is part of it and that people did not correctly predict this as something that would happen.
Something like distillation techniques for LLMs would be used by other countries and then profilerated and that the rationality community as a whole did not take this into account?
I’ll agree with you that Bayes points should be lost in prediction of theory of mind of nation states, it is quite clear that they would be interested in this from a macro-analysis perspective (I say in hindsight of course.)
I’m not sure that Deepseek is SOTA in terms of inherent development, it seems to me that they’re using some of the existing work from OpenAI, Deepmind & Anthropic but I might be wrong here, is there anything else that you’re pointing at?
There are many math and coding benchmarks where models from DeepSeek, Ali baba and tencent are now leading, and definitely leading what was SOTA a year ago. If you don’t want to take my word for it I can dig them up.
No, we good. I was just operating under the assumption that deepseek was just doing distilling of OpenAI but it doesnt seem to be the only good ML company from China. There’s also a bunch of really good ML researchers from China so I agree at this point.