qwen/qwen3-coder-next
Model Card
Qwen3-Coder-Next is an open-weight coding model from Alibaba's Qwen team with 80B total parameters but only 3B active per token, designed specifically for coding agents and local development with a 256K context window. It uses a sparse Mixture-of-Experts (MoE) architecture with hybrid attention, trained on 800K executable coding tasks using reinforcement learning to excel at long-horizon reasoning, tool calling, and recovering from execution failures. It achieves performance comparable to models with 10-20x more active parameters on benchmarks like SWE-Bench while maintaining low inference costs.
Context Window
N/A
tokens
Max Output
66K
tokens
Input Cost
$0.07
per million tokens
Output Cost
$0.3
per million tokens
Release Date
Feb 3, 2026
API Usage Example
Add Qwen3 Coder Next to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "qwen/qwen3-coder-next"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add Qwen3 Coder Next to your app without worrying about API keys or setup.
Read the Docs View Tutorials