The Takeaway: Yann LeCun thinks the future belongs to world models that plan, not chatbots that predict tokens.
- LLMs are “great… for what they do,” but he says they’re a dead end for human- or animal-like intelligence.
- His bet is contrarian: stop generating pixels and start learning abstract representations that predict the consequences of actions.
- He sees the real prize in data efficiency — a system that can learn a new task like a teenager learning to drive, not by hoovering up millions of demos.
LeCun, the Meta AI veteran and Turing Award winner, has now spun out Ami Labs to push what he calls “AI for the real world.” His argument is blunt: language is special, but reality is messier — continuous, noisy, high-dimensional — and that’s where current architectures break. He points to JEPA-style systems as the better path because they learn representations by predicting one view from another, rather than reconstructing every pixel. In his view, that shift matters because intelligence isn’t about regurgitating inputs; it’s about anticipating outcomes and choosing actions through search.
That’s also why he’s skeptical of vision-language-action models and imitation-heavy robotics. They can look impressive, but they’re brittle and data-hungry. “Why can’t a 17-year-old learn to drive in twenty hours?” he asks. If a machine needs endless demonstrations for every new task, it’s not really generalizing. His near-term targets are industrial control, robotics, and other complex systems like jet engines, power plants, and even patient modeling. The long game is bigger: “what we’re designing are systems that are capable of thinking.”