Xiaomi: MiMo-V2-Flash API

Access Xiaomi: MiMo-V2-Flash from Xiaomi using Puter.js AI API.

Get Started

xiaomi/mimo-v2-flash

Model Card

MiMo-V2-Flash is Xiaomi's open-source Mixture-of-Experts language model with 309B total parameters (15B active), designed for high-speed reasoning, coding, and agentic workflows. It uses a hybrid attention architecture with Multi-Token Prediction to achieve up to 150 tokens/second inference while keeping costs extremely low. The model excels at software engineering benchmarks and supports a 256K context window.

Context Window N/A

tokens

Max Output N/A

tokens

Input Cost $0.09

per million tokens

Output Cost $0.29

per million tokens

Release Date Jul 17, 2025

API Usage Example

Add Xiaomi: MiMo-V2-Flash to your app with just a few lines of code.
No API keys, no backend, no configuration required.

// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';

puter.ai.chat("Explain quantum computing in simple terms", {
    model: "xiaomi/mimo-v2-flash"
}).then(response => {
    document.body.innerHTML = response.message.content;
});

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain quantum computing in simple terms", {
            model: "xiaomi/mimo-v2-flash"
        }).then(response => {
            document.body.innerHTML = response.message.content;
        });
    </script>
</body>
</html>

View full documentation →

Frequently Asked Questions

What is this Xiaomi: MiMo-V2-Flash API about?

The Xiaomi: MiMo-V2-Flash API gives you access to Xiaomi's chat model through Puter.js. With just a few lines of JavaScript, you can integrate Xiaomi: MiMo-V2-Flash into any web app or Node.js project — no API keys, no backend, and no configuration required.

Who created Xiaomi: MiMo-V2-Flash?

Xiaomi: MiMo-V2-Flash was created by Xiaomi and released on Jul 17, 2025.

How much does it cost?

The Xiaomi: MiMo-V2-Flash API is available through the User-Pays Model. As a developer, you can add the Xiaomi: MiMo-V2-Flash API to your app for free — your users pay for their own AI costs directly.

	Price per 1M tokens
Input	$0.09
Output	$0.29

How do I access the Xiaomi: MiMo-V2-Flash API?

You can access the Xiaomi: MiMo-V2-Flash API with just a few lines of JavaScript — no API keys, no backend, and no configuration required. Include the Puter.js library in your project and start making calls right away. For more details, check out our documentation.

Does the Xiaomi: MiMo-V2-Flash API work with React / Vue / Vanilla JS / Node / etc.?

Yes — the Xiaomi: MiMo-V2-Flash API works with any JavaScript framework, Node.js, or plain HTML through Puter.js. Just include the library and start building. See the documentation for more details.

Get started with Puter.js

Add Xiaomi: MiMo-V2-Flash to your app without worrying about API keys or setup.

Read the Docs View Tutorials