DeepSeek-R1-0528 is the first major update to the popular DeepSeek R1 series, released on May 28, 2025. The developers revised their approach to depth of thought, and the number of parameters increased to 685 billion, resulting in an improvement of more than 10 percentage points across nearly all significant benchmarks compared to the version released on January 22, 2025.
DeepSeek-R1-0528-Qwen3-8B is a compact model based on Qwen3 with 8 billion parameters, distilled from the flagship version DeepSeek-R1-0528. It achieves state-of-the-art (SOTA) results among open-source models in its category. The model is ideally suited for deployment in resource-constrained environments while retaining advanced mathematical and logical reasoning capabilities from the teacher model.
DeepSeek-R1-Distill-32B — a model built based on distilling a large MoE reasoning expert-level model, setting new records among open-source dense models. It is suitable for scientific, corporate, and educational platforms with high demands on logic and analysis.
DeepSeek-R1 is a unique reasoning model with 671 billion parameters, trained based on reinforcement learning (RL), supporting long chains of thought (CoT), and specializing in multi-step reasoning and logical analysis. It is indispensable for tasks requiring well-founded conclusions and transparent reasoning processes.
DeepSeek-R1-Distill-1.5B — a compact model that, thanks to distillation, possesses strong reasoning capabilities. It is ideal for fast text analysis in mobile and edge applications.