DeepSeek Launches V4 Flash + V4 Pro — Claims Coding-SOTA a Year After R1 Rattled Silicon Valley

Chinese AI lab DeepSeek has rolled out preview versions of its long-awaited V4 series — the flagship V4 Pro for coding and agentic reasoning, alongside V4 Flash for fast/cheap inference. The release lands roughly one year after the company’s R1 model rattled Silicon Valley.

What’s Shipped

V4 Pro: reasoning + agentic — the company’s pitch is top-tier performance on coding benchmarks and complex tool-use
V4 Flash: a smaller, faster sibling targeted at production-scale inference at low per-token cost
Open weights: consistent with DeepSeek’s prior releases — accelerates downstream fine-tuning by labs and enterprises

Why It Matters

Two narratives collide here. Narrative one: a year after R1’s training-cost shock, US labs (OpenAI, Anthropic, Google) have widened their compute lead but cost-curves are again under pressure. Narrative two: DeepSeek’s coding-benchmark claims position V4 Pro as a credible alternative for enterprise dev workflows — particularly where data-residency or budget constrain access to closed-frontier models.

Geopolitical Frame

The release lands amid an export-control landscape that has tightened, not loosened, since R1. Whether DeepSeek’s compute base is truly H800-only (as claimed publicly) or includes grey-market H100s remains a live debate. Either way, open-weight Chinese frontier models are now a recurring quarterly event, not a one-off.