qwen/qwen3-max-thinking
Model Card
Qwen3 Max Thinking is Alibaba Cloud's flagship proprietary reasoning model with a 256K context window, featuring test-time scaling and adaptive tool-use capabilities (web search, code interpreter, memory) that allow it to reason iteratively and autonomously. It scores competitively against GPT-5.2 and Gemini 3 Pro on benchmarks like Humanity's Last Exam and HMMT, excelling in math, complex reasoning, and instruction following.
Context Window
N/A
tokens
Max Output
66K
tokens
Input Cost
$1.2
per million tokens
Output Cost
$6
per million tokens
Release Date
Feb 10, 2026
API Usage Example
Add Qwen3 Max Thinking to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "qwen/qwen3-max-thinking"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add Qwen3 Max Thinking to your app without worrying about API keys or setup.
Read the Docs View Tutorials