Four Chinese AI Models in 12 Days. Export Controls Built a Speed Competition, Not a Capability Gap.
The US government has spent two years arguing that restricting access to advanced semiconductors would create a durable capability gap between American and Chinese AI labs. In a 12-day window earlier this month, four Chinese open-weights models — GLM-5.1 from Z.ai, MiniMax M2.7, Kimi K2.6 from Moonshot AI, and DeepSeek V4 — hit the market at frontier-competitive performance levels and under a third of the price of equivalent US API access.
Kimi K2.6 is the sharpest data point. It became the first open-weights model to beat GPT-5.4 on SWE-Bench Pro — the benchmark used to measure real agentic engineering capability. That's not a closed Chinese model catching up. That's a freely downloadable model surpassing a closed frontier product on the task that matters most for software-building agents.
The pricing gap is structural. Running one million tokens through Claude Opus costs approximately $15 via Anthropic's API. The equivalent through Kimi K2.6 is approximately $4.50. On DeepSeek V4, lower still. For any enterprise making high-volume inference decisions, that's not a preference question — it's a P&L question.
The coordinated timing of four releases in 12 days is itself a signal. This wasn't coincidence. It reflects a Chinese AI ecosystem that has internalized the export control constraint and responded with parallelized development velocity rather than capability retreat. The chip controls were designed to impose a 2–3 year lag. The labs closed it in months by optimizing for architecture efficiency and distributed training on older hardware generations.
For the application layer, this changes the investment calculus in one specific way: model selection is becoming a cost arbitrage decision, not a capability one. The intelligence commodity price is now being set by open-weights competition. Any US startup that built a margin assumption on API pricing differentials needs to revisit the unit economics.
What remains defensible? The same things it always was. Proprietary data that doesn't exist in any training set. Domain expertise that requires years of enterprise context to replicate. Distribution advantages that make switching costs real. The intelligence layer is converging to a commodity. The moat is everywhere the model can't go alone.