DeepSeek-R1 is the first generation of reasoning models developed by DeepSeek-AI and released on January 20, 2025. The model is built upon large-scale reinforcement learning (RL) training and demonstrates outstanding capabilities in solving complex tasks such as mathematics, programming, and scientific reasoning.
DeepSeek-R1 supports long chain-of-thought (CoT) generation, including self-checking, reflection, and alternative approaches to problem-solving. It achieves performance comparable to OpenAI-o1-1217 on benchmarks such as AIME 2024 (79.8%) and MATH-500 (97.3%).
The base version of DeepSeek-R1 contains 671 billion parameters and is highly resource-intensive. However, compact versions of the model are also available (1.5B, 7B, 8B, 14B, 32B, 70B), along with distilled versions derived from DeepSeek-R1 based on Qwen and Llama. As a result, DeepSeek-R1 sets a new standard in the field of reasoning models by combining the power of large-scale RL training with practical applicability, making it one of the best among open-source models.
| Model Name | Context | Type | GPU | Status | Link |
|---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
163,840.0 |
1 | $0.39 | -30.936 | Launch | ||
163,840.0 |
1 | $0.42 | -30.936 | Launch | ||
163,840.0 |
1 | $1.18 | -30.269 | Launch | ||
163,840.0 tensor |
2 | $1.23 | -14.250 | Launch | ||
163,840.0 |
1 | $1.69 | -29.602 | Launch | ||
163,840.0 tensor |
4 | $1.75 | -6.240 | Launch | ||
163,840.0 |
1 | $2.37 | -25.601 | Launch | ||
163,840.0 tensor |
4 | $3.01 | -6.240 | Launch | ||
163,840.0 |
1 | $3.83 | -25.601 | Launch | ||
163,840.0 |
1 | $4.11 | -24.434 | Launch | ||
163,840.0 tensor |
2 | $4.93 | -9.582 | Launch | ||
163,840.0 tensor |
2 | $9.40 | -4.497 | Launch | ||
163,840.0 tensor |
4 | $19.23 | 3.512 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
163,840.0 |
1 | $0.39 | -58.293 | Launch | ||
163,840.0 |
1 | $0.42 | -58.293 | Launch | ||
163,840.0 |
1 | $1.18 | -57.626 | Launch | ||
163,840.0 tensor |
2 | $1.23 | -27.929 | Launch | ||
163,840.0 |
1 | $1.69 | -56.959 | Launch | ||
163,840.0 tensor |
4 | $1.75 | -13.080 | Launch | ||
163,840.0 |
1 | $2.37 | -52.958 | Launch | ||
163,840.0 tensor |
4 | $3.01 | -13.080 | Launch | ||
163,840.0 |
1 | $3.83 | -52.958 | Launch | ||
163,840.0 |
1 | $4.11 | -51.791 | Launch | ||
163,840.0 tensor |
2 | $4.93 | -23.261 | Launch | ||
163,840.0 tensor |
2 | $9.40 | -18.176 | Launch | ||
163,840.0 tensor |
4 | $19.23 | -3.327 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
163,840.0 |
1 | $0.39 | -116.982 | Launch | ||
163,840.0 |
1 | $0.42 | -116.982 | Launch | ||
163,840.0 |
1 | $1.18 | -116.315 | Launch | ||
163,840.0 tensor |
2 | $1.23 | -57.273 | Launch | ||
163,840.0 |
1 | $1.69 | -115.648 | Launch | ||
163,840.0 tensor |
4 | $1.75 | -27.752 | Launch | ||
163,840.0 |
1 | $2.37 | -111.647 | Launch | ||
163,840.0 tensor |
4 | $3.01 | -27.752 | Launch | ||
163,840.0 |
1 | $3.83 | -111.647 | Launch | ||
163,840.0 |
1 | $4.11 | -110.480 | Launch | ||
163,840.0 tensor |
2 | $4.93 | -52.605 | Launch | ||
163,840.0 tensor |
2 | $9.40 | -47.520 | Launch | ||
163,840.0 tensor |
4 | $19.23 | -17.999 | Launch | ||
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.