ML//model//open source model//DeepSeek

2026-03-06

Chinese AI lab. V1 through V3 pushed open-weight models to frontier performance at a fraction of the cost.

Chinese AI lab. V1 through V3 pushed open-weight models to frontier performance at a fraction of the cost.

V3: Mixture of Experts architecture, 671B total params, ~37B active per token.

R1 (Jan 2025): matched o1-level reasoning using reinforcement learning with GRPO: no supervised fine-tuning for the reasoning phase.

Published full training recipes, open-sourced weights. Forced pricing collapse across the industry.