Qwen3-Coder-30B-A3B-Instruct is an outstanding example of a high-quality large language model with advanced specialization in programming. This Mixture-of-Experts model has 30.5 billion total parameters, of which only 3.3 billion are activated per token, and out of 128 experts, only 8 are activated per token. The model comprises 48 hidden layers with grouped query attention (32 heads for Q and 4 for KV), delivering exceptional processing efficiency with minimal computational resource consumption. Native support for a 262,144-token context window—expandable up to 1 million tokens via Yarn—makes the model ideal for working with large code repositories within complex projects.
The key unique feature of Qwen3-Coder-30B-A3B-Instruct lies in its superior agent capabilities. The model does not merely generate code; it autonomously interacts with development tools, executes multi-step programming tasks, and is capable of solving complex problems without human intervention. On the LiveCodeBench v6 benchmark, the model achieves an impressive 66.0%, significantly outperforming the base version Qwen3-30B-A3B (57.4%). In AIME25 tasks (advanced mathematics for programming), it demonstrates 85.0% accuracy, surpassing Gemini-2.5-Flash-Thinking (72.0%) and confidently competing with much larger models. The model outperforms DeepSeek V3 on most coding tasks and delivers agent workflow performance comparable to Claude Sonnet 4, a remarkable achievement for an open-source solution.
Qwen3-Coder-30B-A3B-Instruct unlocks entirely new possibilities in software development. The model is integrated with popular agent-based programming platforms, including Qwen Code, CLINE, Roo Code, and Kilo Code, offering a unified function-calling format for seamless operation within CI/CD pipelines. Support for 358 programming languages makes it a universal reference tool for developers. The model particularly excels in repository-scale understanding scenarios, where it can analyze and modify massive codebases, automatically refactor legacy code, and create complex full-stack applications with minimal developer intervention.
| Model Name | Context | Type | GPU | Status | Link |
|---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
262,144.0 tensor |
4 | $1.29 | 1.331 | Launch | ||
262,144.0 tensor |
4 | $1.43 | 1.331 | Launch | ||
262,144.0 tensor |
4 | $1.75 | 2.531 | Launch | ||
262,144.0 tensor |
4 | $2.34 | 2.531 | Launch | ||
262,144.0 |
1 | $2.37 | 2.244 | Launch | ||
262,144.0 pipeline |
3 | $2.83 | 1.735 | Launch | ||
262,144.0 tensor |
2 | $2.93 | 1.539 | Launch | ||
262,144.0 tensor |
4 | $3.01 | 2.531 | Launch | ||
262,144.0 tensor |
4 | $3.60 | 2.531 | Launch | ||
262,144.0 |
1 | $3.83 | 2.244 | Launch | ||
262,144.0 |
1 | $4.11 | 2.769 | Launch | ||
262,144.0 tensor |
2 | $4.93 | 5.139 | Launch | ||
262,144.0 tensor |
2 | $9.40 | 9.714 | Launch | ||
262,144.0 tensor |
4 | $19.23 | 20.081 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
262,144.0 pipeline |
6 | $1.69 | 1.765 | Launch | ||
262,144.0 tensor |
4 | $1.75 | 1.974 | Launch | ||
262,144.0 tensor |
4 | $2.34 | 1.974 | Launch | ||
262,144.0 |
1 | $2.37 | 1.686 | Launch | ||
262,144.0 pipeline |
3 | $2.83 | 1.178 | Launch | ||
262,144.0 tensor |
4 | $3.01 | 1.974 | Launch | ||
262,144.0 tensor |
4 | $3.60 | 1.974 | Launch | ||
262,144.0 |
1 | $3.83 | 1.686 | Launch | ||
262,144.0 |
1 | $4.11 | 2.211 | Launch | ||
262,144.0 pipeline |
3 | $4.34 | 2.078 | Launch | ||
262,144.0 tensor |
2 | $4.93 | 4.582 | Launch | ||
262,144.0 tensor |
4 | $5.84 | 3.174 | Launch | ||
262,144.0 tensor |
2 | $9.40 | 9.157 | Launch | ||
262,144.0 tensor |
4 | $19.23 | 19.524 | Launch | ||
| Name | GPU | TPS | Max Concurrency | |||
|---|---|---|---|---|---|---|
262,144.0 pipeline |
6 | $3.63 | 2.405 | Launch | ||
262,144.0 |
1 | $4.11 | 1.051 | Launch | ||
262,144.0 tensor |
8 | $4.61 | 3.997 | Launch | ||
262,144.0 tensor |
2 | $4.93 | 3.422 | Launch | ||
262,144.0 tensor |
2 | $4.93 | 3.422 | Launch | ||
262,144.0 pipeline |
6 | $5.83 | 2.405 | Launch | ||
262,144.0 tensor |
4 | $5.84 | 2.014 | Launch | ||
262,144.0 tensor |
8 | $7.51 | 3.997 | Launch | ||
262,144.0 tensor |
2 | $7.84 | 3.422 | Launch | ||
262,144.0 tensor |
2 | $9.40 | 7.997 | Launch | ||
262,144.0 tensor |
4 | $19.23 | 18.364 | Launch | ||
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.