Moonshot AI API
Access Moonshot AI instantly with Puter.js, and add AI to any app in a few lines of code without backend or API keys.
// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';
puter.ai.chat("Explain AI like I'm five!", {
model: "moonshotai/kimi-k2"
}).then(response => {
console.log(response);
});
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain AI like I'm five!", {
model: "moonshotai/kimi-k2"
}).then(response => {
console.log(response);
});
</script>
</body>
</html>
List of Moonshot AI Models
Kimi K2.5
moonshotai/kimi-k2.5
Kimi K2.5 is Moonshot AI's most capable open-source model, a natively multimodal (vision + text) trillion-parameter MoE with 32B active parameters released in January 2026. Built through continual pretraining on ~15 trillion mixed visual and text tokens atop the K2 base, it supports both thinking and instant modes with a 256K context window. It scored 76.8% on SWE-bench Verified, 96.1% on AIME 2025, and 50.2% on Humanity's Last Exam with tools — outperforming Claude Opus 4.5 and GPT-5.2 on the latter. Its standout feature is Agent Swarm, which coordinates up to 100 parallel sub-agents for complex tasks. K2.5 excels at vision-to-code generation, frontend development from screenshots, and large-scale agentic workflows, making it a strong choice for developers building multimodal AI agents.
ChatKimi K2 0905
moonshotai/kimi-k2-0905
Kimi K2 0905 is Moonshot AI's September 2025 update to the original Kimi K2, delivering enhanced coding performance and improved tool-calling reliability. It shares the same 1-trillion-parameter MoE architecture with 32B active parameters but doubles the context window from 128K to 256K tokens. Key improvements include stronger frontend development capabilities — producing cleaner, more polished UI code for frameworks like React, Vue, and Angular — along with better integration across popular agent scaffolds. It scored 53.7% Pass@1 on LiveCodeBench. This version is ideal for developers who want K2's agentic strengths with improved real-world coding quality and longer context support for large codebases.
ChatKimi K2 Thinking
moonshotai/kimi-k2-thinking
Kimi K2 Thinking is Moonshot AI's reasoning-enhanced variant of Kimi K2, trained to interleave step-by-step chain-of-thought with dynamic tool calls. It supports up to 200–300 sequential tool calls without drift, enabling deep autonomous research, coding, and analysis workflows. It achieves 71.3% on SWE-bench Verified, 44.9% on Humanity's Last Exam (with tools), 60.2% on BrowseComp, and 99.1% on AIME 2025 (with Python) — placing it among the top open-source thinking models. It uses native INT4 quantization and a 256K context window. K2 Thinking is designed for complex, multi-step tasks where extended reasoning and sustained tool orchestration matter more than low-latency responses.
ChatKimi K2 0711
moonshotai/kimi-k2
Kimi K2 is a trillion-parameter Mixture-of-Experts model by Moonshot AI, activating 32 billion parameters per token. Designed as a non-thinking model optimized for agentic capabilities, it excels at tool use, code generation, and autonomous problem-solving with a 128K token context window. On benchmarks, K2 scored 65.8% on SWE-bench Verified, 75.1% on GPQA-Diamond, 49.5% on AIME 2025, and 66.1 on Tau2-bench — surpassing most open- and closed-source models in non-thinking settings. It ranked as the #1 open-source model on the LMSYS Arena leaderboard upon release in July 2025. K2 is well suited for developers building AI agents and tool-calling pipelines who need strong coding and reasoning without extended thinking overhead.
ChatKimi Dev 72B
moonshotai/kimi-dev-72b
Kimi Dev 72B is a 72-billion-parameter coding model by Moonshot AI, purpose-built for software engineering tasks like bug fixing, code generation, and unit test creation. It is based on the Qwen 2.5-72B architecture and fine-tuned with large-scale reinforcement learning on real-world GitHub issues and pull requests. The model achieved 60.4% on SWE-bench Verified, setting a state-of-the-art result among open-source models at the time of its June 2025 release. It uses a two-stage framework — file localization followed by precise code editing — that mirrors how human developers approach issue resolution. Kimi Dev 72B is a strong pick for automated code repair and test generation workflows where a specialized coding model outperforms general-purpose alternatives.
Frequently Asked Questions
The Moonshot AI API gives you access to models for AI chat. Through Puter.js, you can start using Moonshot AI models instantly with zero setup or configuration.
Puter.js supports a variety of Moonshot AI models, including Kimi K2.5, Kimi K2 0905, Kimi K2 Thinking, and more. Find all AI models supported by Puter.js in the AI model list.
With the User-Pays model, users cover their own AI costs through their Puter account. This means you can build apps without worrying about infrastructure expenses.
Puter.js is a JavaScript library that provides access to AI, storage, and other cloud services directly from a single API. It handles authentication, infrastructure, and scaling so you can focus on building your app.
Yes — the Moonshot AI API through Puter.js works with any JavaScript framework, Node.js, or plain HTML. Just include the library and start building. See the documentation for more details.