DeepSeek-R1-Distill-32B is a distilled model built upon Qwen2.5-32B, incorporating the best reasoning algorithms from DeepSeek-R1 and expert knowledge. It sets new records among open-source dense models across several reasoning benchmarks: AIME 2024–72.6%, MATH-500–94.3%, and others. In practice, this model is nearly on par with the distilled 70-billion-parameter version and even surpasses it in certain tests.
Technically, the model is designed for solving expert-level tasks: complex mathematical computations, code generation and analysis, scientific research, and processing long or intricate contexts. DeepSeek-R1-Distill-32B can be integrated into enterprise systems, cloud services, and platforms for automating intellectual labor. For end-user applications, it is indispensable for building expert systems, scientific assistants, platforms for automating complex business processes, and educational solutions that require thorough and well-articulated explanations in responses.
DeepSeek-R1-Distill-32B is the choice for those seeking maximum performance among open-source models without the need to move to the heaviest systems.
| Model Name | Context | Type | GPU | Status | Link |
|---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
131,072.0 pipeline |
3 | $0.88 | 0.515 | Launch | ||
131,072.0 tensor |
4 | $1.18 | 0.325 | Launch | ||
131,072.0 tensor |
2 | $1.23 | 0.593 | Launch | ||
131,072.0 tensor |
4 | $1.29 | 0.887 | Launch | ||
131,072.0 pipeline |
3 | $1.31 | 0.515 | Launch | ||
131,072.0 tensor |
4 | $1.43 | 0.887 | Launch | ||
131,072.0 tensor |
4 | $1.75 | 1.787 | Launch | ||
131,072.0 tensor |
4 | $1.88 | 0.212 | Launch | ||
131,072.0 tensor |
2 | $1.92 | 0.593 | Launch | ||
131,072.0 |
1 | $2.37 | 1.572 | Launch | ||
131,072.0 tensor |
2 | $2.93 | 1.043 | Launch | ||
131,072.0 tensor |
4 | $3.01 | 1.787 | Launch | ||
131,072.0 |
1 | $3.83 | 1.572 | Launch | ||
131,072.0 |
1 | $4.11 | 1.965 | Launch | ||
131,072.0 tensor |
2 | $4.93 | 3.743 | Launch | ||
131,072.0 tensor |
2 | $9.40 | 7.175 | Launch | ||
131,072.0 tensor |
4 | $19.23 | 14.950 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
131,072.0 pipeline |
3 | $0.88 | 0.116 | Launch | ||
131,072.0 tensor |
4 | $1.18 | -0.074 | Launch | ||
131,072.0 tensor |
2 | $1.23 | 0.195 | Launch | ||
131,072.0 tensor |
4 | $1.29 | 0.488 | Launch | ||
131,072.0 pipeline |
3 | $1.31 | 0.116 | Launch | ||
131,072.0 tensor |
4 | $1.43 | 0.488 | Launch | ||
131,072.0 tensor |
4 | $1.75 | 1.388 | Launch | ||
131,072.0 tensor |
4 | $1.88 | -0.187 | Launch | ||
131,072.0 tensor |
2 | $1.92 | 0.195 | Launch | ||
131,072.0 |
1 | $2.37 | 1.173 | Launch | ||
131,072.0 tensor |
2 | $2.93 | 0.645 | Launch | ||
131,072.0 tensor |
4 | $3.01 | 1.388 | Launch | ||
131,072.0 |
1 | $3.83 | 1.173 | Launch | ||
131,072.0 |
1 | $4.11 | 1.566 | Launch | ||
131,072.0 tensor |
2 | $4.93 | 3.345 | Launch | ||
131,072.0 tensor |
2 | $9.40 | 6.776 | Launch | ||
131,072.0 tensor |
4 | $19.23 | 14.551 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
131,072.0 pipeline |
3 | $0.88 | -0.933 | Launch | ||
131,072.0 tensor |
4 | $1.18 | -1.123 | Launch | ||
131,072.0 tensor |
2 | $1.23 | -0.854 | Launch | ||
131,072.0 tensor |
4 | $1.29 | -0.561 | Launch | ||
131,072.0 pipeline |
3 | $1.31 | -0.933 | Launch | ||
131,072.0 tensor |
4 | $1.43 | -0.561 | Launch | ||
131,072.0 tensor |
4 | $1.75 | 0.339 | Launch | ||
131,072.0 tensor |
4 | $1.88 | -1.236 | Launch | ||
131,072.0 tensor |
2 | $1.92 | -0.854 | Launch | ||
131,072.0 |
1 | $2.37 | 0.124 | Launch | ||
131,072.0 tensor |
2 | $2.93 | -0.404 | Launch | ||
131,072.0 tensor |
4 | $3.01 | 0.339 | Launch | ||
131,072.0 |
1 | $3.83 | 0.124 | Launch | ||
131,072.0 |
1 | $4.11 | 0.517 | Launch | ||
131,072.0 tensor |
2 | $4.93 | 2.296 | Launch | ||
131,072.0 tensor |
2 | $9.40 | 5.727 | Launch | ||
131,072.0 tensor |
4 | $19.23 | 13.502 | Launch | ||
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.