DeepSeek-R1-Distill-32B is a distilled model built upon Qwen2.5-32B, incorporating the best reasoning algorithms from DeepSeek-R1 and expert knowledge. It sets new records among open-source dense models across several reasoning benchmarks: AIME 2024–72.6%, MATH-500–94.3%, and others. In practice, this model is nearly on par with the distilled 70-billion-parameter version and even surpasses it in certain tests.
Technically, the model is designed for solving expert-level tasks: complex mathematical computations, code generation and analysis, scientific research, and processing long or intricate contexts. DeepSeek-R1-Distill-32B can be integrated into enterprise systems, cloud services, and platforms for automating intellectual labor. For end-user applications, it is indispensable for building expert systems, scientific assistants, platforms for automating complex business processes, and educational solutions that require thorough and well-articulated explanations in responses.
DeepSeek-R1-Distill-32B is the choice for those seeking maximum performance among open-source models without the need to move to the heaviest systems.
| Model Name | Context | Type | GPU | Status | Link |
|---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
131,072.0 pipeline |
3 | $1.34 | 1.130 | Launch | ||
131,072.0 tensor |
4 | $1.62 | 1.716 | Launch | ||
131,072.0 pipeline |
6 | $1.65 | 1.473 | Launch | ||
131,072.0 pipeline |
3 | $2.29 | 1.221 | Launch | ||
131,072.0 tensor |
4 | $2.34 | 1.716 | Launch | ||
131,072.0 |
1 | $2.37 | 1.641 | Launch | ||
131,072.0 pipeline |
3 | $2.83 | 1.218 | Launch | ||
131,072.0 tensor |
4 | $2.89 | 1.838 | Launch | ||
131,072.0 tensor |
2 | $2.93 | 1.087 | Launch | ||
131,072.0 tensor |
4 | $3.60 | 1.833 | Launch | ||
131,072.0 |
1 | $3.83 | 1.639 | Launch | ||
131,072.0 |
1 | $4.11 | 2.039 | Launch | ||
131,072.0 tensor |
2 | $4.61 | 3.841 | Launch | ||
131,072.0 |
1 | $4.74 | 3.382 | Launch | ||
131,072.0 tensor |
2 | $9.40 | 7.323 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
131,072.0 tensor |
4 | $1.62 | 1.276 | Launch | ||
131,072.0 pipeline |
6 | $1.65 | 1.019 | Launch | ||
131,072.0 tensor |
4 | $2.34 | 1.276 | Launch | ||
131,072.0 |
1 | $2.37 | 1.201 | Launch | ||
131,072.0 tensor |
4 | $2.89 | 1.398 | Launch | ||
131,072.0 tensor |
4 | $3.60 | 1.393 | Launch | ||
131,072.0 |
1 | $3.83 | 1.199 | Launch | ||
131,072.0 |
1 | $4.11 | 1.599 | Launch | ||
131,072.0 pipeline |
3 | $4.34 | 1.439 | Launch | ||
131,072.0 tensor |
2 | $4.61 | 3.401 | Launch | ||
131,072.0 |
1 | $4.74 | 2.942 | Launch | ||
131,072.0 tensor |
4 | $5.74 | 2.294 | Launch | ||
131,072.0 tensor |
2 | $9.40 | 6.883 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
131,072.0 tensor |
8 | 2.887 | Launch | |||
131,072.0 |
1 | $4.74 | 2.034 | Launch | ||
131,072.0 tensor |
2 | $4.93 | 2.494 | Launch | ||
131,072.0 tensor |
2 | $4.94 | 2.494 | Launch | ||
131,072.0 tensor |
4 | $5.76 | 1.386 | Launch | ||
131,072.0 pipeline |
6 | $5.84 | 1.622 | Launch | ||
131,072.0 tensor |
8 | $7.52 | 2.877 | Launch | ||
131,072.0 tensor |
2 | $7.85 | 2.489 | Launch | ||
131,072.0 tensor |
2 | $8.17 | 3.289 | Launch | ||
131,072.0 tensor |
2 | $9.41 | 5.975 | Launch | ||
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.