Qwen2-72B is the flagship model of the series, featuring 72 billion parameters. Its architecture includes 80 layers with a hidden size of 8192 and implements the Grouped Query Attention mechanism with 64 query heads and 8 shared key-value heads. Combined with Dual Chunk Attention and YARN technologies, this design ensures maximum performance in processing long contexts and efficient management of KV-cache memory.
The model was trained on a high-quality dataset of 7 trillion tokens, offering maximum data diversity. The base version of the model achieves outstanding results on key benchmarks: 84.2 on MMLU, 37.9 on GPQA, 64.6 on HumanEval, 89.5 on GSM8K, and 82.4 on BBH. The instruction-tuned version, Qwen2-72B-Instruct, scores 9.1 on MT-Bench, 48.1 on Arena-Hard, and 35.7 on LiveCodeBench, placing it among the top-tier proprietary models.
Qwen2-72B demonstrates exceptional capabilities in complex reasoning, step-by-step problem solving, advanced programming, and deep contextual understanding. Its multilingual support enables professional-level performance in more than 30 languages, including Russian. Accordingly, it is designed for the most demanding AI use cases—high-level scientific research, complex software development, creation of high-quality professional content, advanced data analysis, automation of complex business processes, and intelligent decision-making systems.
Model Name | Context | Type | GPU | TPS | Status | Link |
---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
Name | vCPU | RAM, MB | Disk, GB | GPU | |||
---|---|---|---|---|---|---|---|
16 | 65536 | 160 | 2 | $0.93 | Launch | ||
16 | 65536 | 160 | 4 | $1.48 | Launch | ||
16 | 65536 | 160 | 2 | $1.67 | Launch | ||
16 | 65536 | 160 | 2 | $2.19 | Launch | ||
16 | 65535 | 240 | 2 | $2.22 | Launch | ||
16 | 65536 | 160 | 1 | $2.58 | Launch | ||
16 | 65536 | 160 | 2 | $2.93 | Launch | ||
16 | 65536 | 160 | 1 | $5.11 | Launch |
Name | vCPU | RAM, MB | Disk, GB | GPU | |||
---|---|---|---|---|---|---|---|
16 | 131072 | 160 | 4 | $1.75 | Launch | ||
16 | 131072 | 160 | 4 | $3.23 | Launch | ||
16 | 131072 | 160 | 4 | $4.26 | Launch | ||
16 | 98304 | 160 | 3 | $4.34 | Launch | ||
24 | 262144 | 160 | 2 | $5.35 | Launch | ||
24 | 262144 | 160 | 2 | $10.40 | Launch |
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.