z-ai/glm-5
Model Card
GLM-5 is Zhipu AI's (Z.ai) fifth-generation flagship open-weight foundation model with 744B total parameters (40B active) in a Mixture of Experts architecture, designed for agentic engineering, complex systems coding, and long-horizon agent tasks. It achieves state-of-the-art performance among open-weight models on coding and agentic benchmarks like SWE-bench Verified and Terminal Bench 2.0, approaching Claude Opus 4.5-level capability.
Context Window
N/A
tokens
Max Output
203K
tokens
Input Cost
$1
per million tokens
Output Cost
$3.2
per million tokens
Release Date
Feb 12, 2026
API Usage Example
Add Z.AI: GLM 5 to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "z-ai/glm-5"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add Z.AI: GLM 5 to your app without worrying about API keys or setup.
Read the Docs View Tutorials