DeepSeek V4 Runs on Less - and That Changes the Economics

DeepSeek dropped their V4 model this week. It matches GPT-4 and Claude on benchmarks. It processes a million tokens using 27% of the compute their previous model needed. And it's open-source.

That last bit is what makes this land differently.

The compute story nobody's telling

DeepSeek V4 was built on Chinese chips - specifically designed to work around export restrictions on high-end GPUs. Not as a workaround. As a first-class architecture choice.

Most frontier models assume you have access to the best hardware money can buy. DeepSeek started from the opposite constraint: what if you don't?

The answer is a model that uses a quarter of the compute for the same capability. That's not incremental improvement. That's a different approach to the problem.

For developers, this changes the maths. Running inference on V4 costs less. Fine-tuning costs less. The barrier to experimenting with frontier-level models just dropped.

Open-source at frontier scale

Open-source models have always trailed the closed ones. Llama catches up to GPT-3.5. Mistral gets close to GPT-4. But they're always a generation behind.

V4 is matching the current frontier. Not six months ago's frontier. Today's.

That matters because it proves the gap isn't fundamental. It's not that open models can't reach frontier performance. It's that they haven't had the resources.

DeepSeek just showed what's possible when a well-funded team commits to open release at the same time they're pushing capability. The weights are public. The architecture is documented. Anyone can run it.

Which means anyone can build on it.

Independent infrastructure, proven

The third piece is geopolitical, but it has technical implications everywhere.

For the last three years, the assumption has been: if you want frontier AI, you need access to NVIDIA's best chips. Export controls were designed around that assumption. Cut off the chips, slow down the capability.

DeepSeek V4 is proof that assumption no longer holds. You can build frontier models on different hardware. You can optimise around constraints. You can close the gap with architecture, not just compute.

That's not just a win for China. It's a signal to every country and company watching the AI race. The path to capability is wider than it looked six months ago.

What this means for builders

If you're building on top of models, your options just expanded. V4 is open-weight, so you can fine-tune it, run it locally, or host it yourself. The compute efficiency means lower costs at inference time.

If you're a researcher, you have a new baseline. Open access to a model this capable changes what's possible in academic labs and smaller institutions.

And if you're a business watching AI costs, this is the proof that efficiency improvements are real. The cost curve is bending, and it's bending faster than most people expected.

DeepSeek V4 isn't just another model release. It's evidence that the rules of the game are shifting. Compute efficiency, open access, and independent infrastructure - all three at once.

The models that matter next year might not come from the names we expect. And they might not need the hardware we assumed was essential.