Humans are hilariously bad at wilderness survival in the absence of societal knowledge and support. The support doesn’t need to be 21st-century-shaped but we do need both physical and social technology to survive and reproduce reliably.
That doesn’t matter much, though, because humans live in an environment which contains human civilization. The “holes” in our capabilities don’t come up very often.
The right tools could also paper over many of the deficiencies of LLM agents. I don’t expect the tools which make groups of LLM agents able to collectively do impressive things to result in particularly human-shaped agents though.
Concretely, sample efficiency is very important if you want a human-like agent that can learn on the job in a reasonable amount of time. It’s much less important if you can train once on how to complete each task with a standardized set of tools, and then copy the trained narrow system around as needed.
(Note: perhaps I should say “language-capable agent” rather than “llm-based agent”)
My point is more that we have millennia of experience building tools and social structures for making humans able to successfully accomplish tasks, and maybe 2 years of experience building tools and structures for making LLM agents able to successfully accomplish tasks.
I do agree that there’s some difference in generality, but I expect that if we had spent millennia gathering experience building tools and structures tailored towards making LLMs more effective, the generality failures of LLMs would look a lot less crippling.
If you take a bunch of LLMs and try to get them to collaboratively build a 1GW power plant, they are going to fail mostly in ways like
they have hilariously poor vision
they don’t make effective use of new tools
they don’t create new tools to trivialize repetitive tasks
they get caught in loops of trying the same ineffective thing over and over
All of these are failure modes which can be substantially mitigated by better scaffolding of the sort that is hard to design in one shot but easy to iteratively improve over time.