ML//model//open source model//DeepSeek

Chinese AI lab. V1 through V3 pushed open-weight models to frontier performance at a fraction of the cost.


Chinese AI lab. V1 through V3 pushed open-weight models to frontier performance at a fraction of the cost.

V3: Mixture of Experts architecture — 671B total params, ~37B active per token.

R1 (Jan 2025): matched o1-level reasoning using reinforcement learning with GRPO — no supervised fine-tuning for the reasoning phase.

Published full training recipes, open-sourced weights. Forced pricing collapse across the industry.