Tesla Kills Dojo: Musk Just Ripped Out His Own Kidney to Bet on World Models

I’ll admit it: when Musk suddenly axed the entire Dojo team my first reaction was “fake news.” This wasn’t some side hustle; it was the multi-year, multi-billion-dollar “favorite son” hyped every earnings call. Picture a man at the peak of mid-life success calmly slicing out his own kidney—that’s the level of pain we’re talking about.
But this isn’t a simple strategic retreat or a dumb write-off. It’s a brutal, deliberate, utterly rational sacrifice. The blade Musk just swung didn’t merely kill a home-grown chip that’s four generations behind Nvidia and stuck in an evolutionary cul-de-sac. It announced to the whole industry: the old god of pure LLMs is dying, and a new deity—the World Model—is about to descend. Tesla, or rather Musk’s entire AI empire, is undergoing a full-brain transplant, a complete decoupling of mind and body.
The Twilight of LLMs
Let’s talk about the sacrificed god. Face it: large language models are topping out. However loud the GPT-5 headlines are, the jump feels nothing like 3.5 → 4; against Grok or Claude it only inches ahead. The culprit is architectural.
A killer meme called the “six-finger test” exposes the软肋:
Show GPT-5 a Photoshopped hand with six fingers and it will swear on a stack of transformers that you’re looking at five. It will even circle the “correct” five, treating the extra digit as background noise. This isn’t stupidity; it’s a worldview bug. LLMs learn from text, not reality. “Humans have five fingers” is scripture, and when pixels contradict scripture, scripture wins.
More data, more compute—none of that fixes it.
Purely linguistic, two-dimensional compression can’t cross the chasm to three-dimensional physics. As robotics founder Wang Xingxing says, pretrained data is useless on the robot side; as Fei-Fei Li puts it, “The world is 3-D.” Language is just a flat shadow cast by a solid reality.
World Model Arrives
While LLMs chase shadows, Google DeepMind’s Genie 3 cracked open the 3-D door. Instead of reading words it watched oceans of video and, unsupervised, bootstrapped an interactive, physics-obedient, spatiotemporally consistent virtual world.
This is qualitatively different—AlphaZero for reality. Freed from human language labels, Genie 3 learns straight from pixels. Scribbles on a wall stay there when you turn around; it streams 24 fps for over a minute, every frame back-tracked for perfect physical coherence.
That’s the future of embodied AI. Yesterday we threw robots into the real world to rack up expensive collisions; tomorrow we spawn millions of agents inside a world model and train 24/7 at near-zero marginal cost.
Which is why Musk is switching brains: let X.AI and Nvidia handle the pre-training “brain,” while millions of Teslas and Optimuses become the data-gathering “body.” Mind takes advanced math, body takes combat sports—division of labor, maximum efficiency.
Dao vs. Shu
Dojo’s death isn’t a Tesla sideshow; it’s the moment the whole industry pivots from “shu” (tactics) to “dao” (path). Yesterday we competed on who had more parameters and bigger GPUs—pure tactics. Tomorrow the schism is philosophical: are you doubling down on 2-D language idols or worshipping at the church of 3-D reality?
Musk paid a kidney to answer. He’s done chasing Nvidia in a losing silicon race and is betting everything on the new dao: a body that continuously harvests high-quality real-world data, paired with the best off-the-shelf brain money can buy. Smarter, cheaper, faster.
Jensen’s Sweet Headache
Which brings us to the man in the leather jacket. Dojo’s funeral looks like Jensen Huang’s coronation—until you notice the headache. Whether you serve LLMs or World Models, the beasts devour compute, keeping Nvidia and TSMC on the throne. But the balance is shifting from training to inference, from a handful of hyperscalers to an ocean of edge devices. The way the pie is sliced is changing forever.
In the end, Musk simply saw the end of the sentence. The exquisite realm of words and symbols is still just Flatland.
Goodbye, flatland—reality is 3-D.
Adapted from my podcast: People’s Park Talks AI
👉 Subscribe to “JustSayAI Daily Brief” · twice a day · one-click listen: https://justsayai.org/newsletter (VPN required)
【Follow us】:
📺 Bilibili: JustCallMeXiaoSu
📕 Xiaohongshu: People’s Park Talks AI
▶️ YouTube: People’s Park Talks AI
