Human languages are regularized because we have to talk to other humans.
But even so, humans develop new languages over time because of random drift.
I agree the language-mixing and some of the shorthand are not necessarily reasons to predict LLMs creating new languages. But there’s other evidence. Like the disclaim overshadow stuff. Or just the fact that you can very easily cook your RL and make agents produce gibberish CoTs while maintaining task performance (narrow task performance, I think this blows up the models overall capabilities).
Human languages are regularized because we have to talk to other humans.
But even so, humans develop new languages over time because of random drift.
I agree the language-mixing and some of the shorthand are not necessarily reasons to predict LLMs creating new languages. But there’s other evidence. Like the disclaim overshadow stuff. Or just the fact that you can very easily cook your RL and make agents produce gibberish CoTs while maintaining task performance (narrow task performance, I think this blows up the models overall capabilities).