Moonshot AI: Kimi Dev 72B

This model is no longer available.

Add AI to your application with Puter.js.

Explore Other Models

Model Card

Kimi Dev 72B is a 72-billion-parameter coding model by Moonshot AI, purpose-built for software engineering tasks like bug fixing, code generation, and unit test creation. It is based on the Qwen 2.5-72B architecture and fine-tuned with large-scale reinforcement learning on real-world GitHub issues and pull requests.

The model achieved 60.4% on SWE-bench Verified, setting a state-of-the-art result among open-source models at the time of its June 2025 release. It uses a two-stage framework — file localization followed by precise code editing — that mirrors how human developers approach issue resolution.

Kimi Dev 72B is a strong pick for automated code repair and test generation workflows where a specialized coding model outperforms general-purpose alternatives.

Context Window N/A

tokens

Max Output 131K

tokens

Input Cost $0.29

per million tokens

Output Cost $1.15

per million tokens

Release Date May 15, 2025

 

Code Example

Add AI to your app with the Puter.js AI API — no API keys or setup required.

// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';

puter.ai.chat("Explain quantum computing in simple terms").then(response => {
    document.body.innerHTML = response.message.content;
});
<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain quantum computing in simple terms").then(response => {
            document.body.innerHTML = response.message.content;
        });
    </script>
</body>
</html>

More AI Models From Moonshot AI

Find other Moonshot AI models

Chat

Kimi K2.6

Kimi K2.6 is Moonshot AI's latest open-weight multimodal model, built on a 1-trillion-parameter mixture-of-experts architecture with a 256K context window. It excels at agentic coding and long-horizon execution, supporting sustained autonomous workflows with 4,000+ tool calls across languages like Rust, Go, and Python. On key benchmarks, it scores 58.6 on SWE-Bench Pro, 54.0 on HLE with Tools, and 50.0 on Toolathlon — competitive with GPT-5.4 and Claude Opus 4.6 on coding and agent tasks, though trailing them on pure reasoning. The model accepts text, image, and video input, supports both thinking and non-thinking modes, and offers an OpenAI-compatible API. It's a strong pick for developers building multi-step agentic workflows and complex software engineering pipelines.

Chat

Kimi K2.5

Kimi K2.5 is Moonshot AI's most capable open-source model, a natively multimodal (vision + text) trillion-parameter MoE with 32B active parameters released in January 2026. Built through continual pretraining on ~15 trillion mixed visual and text tokens atop the K2 base, it supports both thinking and instant modes with a 256K context window. It scored 76.8% on SWE-bench Verified, 96.1% on AIME 2025, and 50.2% on Humanity's Last Exam with tools — outperforming Claude Opus 4.5 and GPT-5.2 on the latter. Its standout feature is Agent Swarm, which coordinates up to 100 parallel sub-agents for complex tasks. K2.5 excels at vision-to-code generation, frontend development from screenshots, and large-scale agentic workflows, making it a strong choice for developers building multimodal AI agents.

Chat

Kimi K2 0905

Kimi K2 0905 is Moonshot AI's September 2025 update to the original Kimi K2, delivering enhanced coding performance and improved tool-calling reliability. It shares the same 1-trillion-parameter MoE architecture with 32B active parameters but doubles the context window from 128K to 256K tokens. Key improvements include stronger frontend development capabilities — producing cleaner, more polished UI code for frameworks like React, Vue, and Angular — along with better integration across popular agent scaffolds. It scored 53.7% Pass@1 on LiveCodeBench. This version is ideal for developers who want K2's agentic strengths with improved real-world coding quality and longer context support for large codebases.

Frequently Asked Questions

How do I use Kimi Dev 72B?

You can access Kimi Dev 72B by Moonshot AI through Puter.js AI API. Include the library in your web app or Node.js project and start making calls with just a few lines of JavaScript — no backend and no configuration required. You can also use it with Python or cURL via Puter's OpenAI-compatible API.

Is Kimi Dev 72B free?

Yes, it is free if you're using it through Puter.js. With the User-Pays Model, you can add Kimi Dev 72B to your app at no cost — your users pay for their own AI usage directly, making it completely free for you as a developer.

What is the pricing for Kimi Dev 72B?
Pricing for Kimi Dev 72B is based on the number of input and output tokens used per request.
Price per 1M tokens
Input$0.29
Output$1.15
Who created Kimi Dev 72B?

Kimi Dev 72B was created by Moonshot AI and released on May 15, 2025.

What is the max output length of Kimi Dev 72B?

Kimi Dev 72B can generate up to 131K tokens in a single response.

Does it work with React / Vue / Vanilla JS / Node / etc.?

Yes — the Kimi Dev 72B API works with any JavaScript framework, Node.js, or plain HTML through Puter.js. Just include the library and start building. See the documentation for more details.

Get started with Puter.js

Add AI to your application without worrying about API keys or setup.

Explore Models View Tutorials