x-ai/grok-3-mini-fast
Model Card
Grok 3 Mini Fast is the speed-optimized variant of Grok 3 Mini, running on faster infrastructure for significantly quicker response times. It provides identical reasoning quality to Grok 3 Mini but is designed for latency-sensitive applications.
Context Window
131K
tokens
Max Output
131,072
tokens
Input Cost
$0.6
per million tokens
Output Cost
$4
per million tokens
API Usage Example
Add Grok 3 Mini Fast to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "x-ai/grok-3-mini-fast"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add Grok 3 Mini Fast to your app without worrying about API keys or setup.
Read the Docs View Tutorials