Qwen3-Coder-30B-A3B-Instruct is an outstanding example of a high-quality large language model with advanced specialization in programming. This Mixture-of-Experts model has 30.5 billion total parameters, of which only 3.3 billion are activated per token, and out of 128 experts, only 8 are activated per token. The model comprises 48 hidden layers with grouped query attention (32 heads for Q and 4 for KV), delivering exceptional processing efficiency with minimal computational resource consumption. Native support for a 262,144-token context window—expandable up to 1 million tokens via Yarn—makes the model ideal for working with large code repositories within complex projects.
The key unique feature of Qwen3-Coder-30B-A3B-Instruct lies in its superior agent capabilities. The model does not merely generate code; it autonomously interacts with development tools, executes multi-step programming tasks, and is capable of solving complex problems without human intervention. On the LiveCodeBench v6 benchmark, the model achieves an impressive 66.0%, significantly outperforming the base version Qwen3-30B-A3B (57.4%). In AIME25 tasks (advanced mathematics for programming), it demonstrates 85.0% accuracy, surpassing Gemini-2.5-Flash-Thinking (72.0%) and confidently competing with much larger models. The model outperforms DeepSeek V3 on most coding tasks and delivers agent workflow performance comparable to Claude Sonnet 4, a remarkable achievement for an open-source solution.
Qwen3-Coder-30B-A3B-Instruct unlocks entirely new possibilities in software development. The model is integrated with popular agent-based programming platforms, including Qwen Code, CLINE, Roo Code, and Kilo Code, offering a unified function-calling format for seamless operation within CI/CD pipelines. Support for 358 programming languages makes it a universal reference tool for developers. The model particularly excels in repository-scale understanding scenarios, where it can analyze and modify massive codebases, automatically refactor legacy code, and create complex full-stack applications with minimal developer intervention.
| Model Name | Context | Type | GPU | Status | Link |
|---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
262,144.0 tensor |
2 | $0.93 | 1.000 | Launch | ||
262,144.0 tensor |
4 | $0.96 | 1.392 | Launch | ||
262,144.0 tensor |
2 | $1.23 | 1.000 | Launch | ||
262,144.0 tensor |
4 | $1.26 | 1.392 | Launch | ||
262,144.0 tensor |
2 | $1.56 | 1.000 | Launch | ||
262,144.0 tensor |
2 | $1.92 | 1.000 | Launch | ||
262,144.0 tensor |
2 | $2.22 | 1.600 | Launch | ||
262,144.0 |
1 | $2.37 | 2.304 | Launch | ||
262,144.0 tensor |
2 | $2.93 | 1.600 | Launch | ||
262,144.0 |
1 | $3.83 | 2.304 | Launch | ||
262,144.0 |
1 | $4.11 | 2.829 | Launch | ||
262,144.0 |
1 | $4.74 | 4.592 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
262,144.0 pipeline |
3 | $1.34 | 1.204 | Launch | ||
262,144.0 tensor |
4 | $1.62 | 2.000 | Launch | ||
262,144.0 pipeline |
6 | $1.65 | 1.791 | Launch | ||
262,144.0 tensor |
2 | $2.22 | 1.008 | Launch | ||
262,144.0 pipeline |
3 | $2.29 | 1.204 | Launch | ||
262,144.0 tensor |
4 | $2.34 | 2.000 | Launch | ||
262,144.0 |
1 | $2.37 | 1.712 | Launch | ||
262,144.0 pipeline |
3 | $2.83 | 1.204 | Launch | ||
262,144.0 tensor |
4 | $2.89 | 2.000 | Launch | ||
262,144.0 tensor |
2 | $2.93 | 1.008 | Launch | ||
262,144.0 tensor |
4 | $3.60 | 2.000 | Launch | ||
262,144.0 |
1 | $3.83 | 1.712 | Launch | ||
262,144.0 |
1 | $4.11 | 2.237 | Launch | ||
262,144.0 |
1 | $4.74 | 4.000 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
262,144.0 pipeline |
6 | $3.50 | 2.408 | Launch | ||
262,144.0 |
1 | $4.11 | 1.054 | Launch | ||
262,144.0 tensor |
4 | $4.35 | 2.016 | Launch | ||
262,144.0 tensor |
2 | $4.61 | 3.425 | Launch | ||
262,144.0 tensor |
8 | $4.61 | 4.000 | Launch | ||
262,144.0 |
1 | $4.74 | 2.816 | Launch | ||
262,144.0 tensor |
4 | $5.74 | 2.016 | Launch | ||
262,144.0 pipeline |
6 | $5.83 | 2.408 | Launch | ||
262,144.0 tensor |
8 | $7.51 | 4.000 | Launch | ||
262,144.0 tensor |
2 | $7.84 | 3.425 | Launch | ||
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.