Neel Nanda comments on Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study

Neel Nanda 15 Apr 2025 5:51 UTC
8 points
3
My understanding is that there was a separate image model in historical vlms like flamingo but that it passed on a vector representation of the image not text
- Davidmanheim 15 Apr 2025 13:19 UTC
  4 points
  1
  Parent
  I understood, very much secondhand, that current LLMs are still using a separately trained part of the model’s input space for images. I’m very unsure how the model weights are integrating the different types of thinking, but am by default skeptical that it integrates cleanly into other parts of reasoning.
  That said, I’m also skeptical that this is fundamentally a had part of the problem, as simulation and generated data seems like a very tractable route to improving this, if/once model developers see it as a critical bottleneck for tens of billions of dollars in revenue.