minimax/minimax-01
Model Card
MiniMax-01 is a 456B parameter foundation model (45.9B activated) using a hybrid Lightning Attention + MoE architecture, achieving top-tier performance on reasoning, math, and coding benchmarks. It supports up to 4 million tokens of context, making it especially strong for long-context tasks and AI agent applications.
Context Window
N/A
tokens
Max Output
1,000,192
tokens
Input Cost
$0.2
per million tokens
Output Cost
$1.1
per million tokens
API Usage Example
Add MiniMax: MiniMax-01 to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "minimax/minimax-01"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add MiniMax: MiniMax-01 to your app without worrying about API keys or setup.
Read the Docs View Tutorials