Cole Wyeth comments on OpenAI’s Sora is an agent

Cole Wyeth 16 Feb 2024 17:33 UTC
5 points
3
One could hook up a language model to decide what to visualize, Sora to generate visualizations, and a vision model to extract outcomes.
This seems like around 40% of what intelligence is—the only thing I don’t really see is how reward should be “plugged in,” but there may be naive ways to set goals.