xAI: Grok 4.1 Fast Non-Reasoning API
Access xAI: Grok 4.1 Fast Non-Reasoning from xAI using Puter.js AI API.
Get Startedx-ai/grok-4-1-fast-non-reasoning
Model Card
Grok 4.1 Fast Non-Reasoning is the low-latency, non-reasoning variant of Grok 4.1 Fast that skips extended chain-of-thought for speed-critical applications. It shares the same model weights and 2M context window as Grok 4.1 Fast but delivers instant responses without deliberation overhead, ideal for real-time customer support and streaming interactions.
Context Window
N/A
tokens
Max Output
2M
tokens
Input Cost
$0.2
per million tokens
Output Cost
$0.5
per million tokens
Release Date
Nov 19, 2025
API Usage Example
Add xAI: Grok 4.1 Fast Non-Reasoning to your app with just a few lines of code.
No API keys, no backend, no configuration required.
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "x-ai/grok-4-1-fast-non-reasoning"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
Get started with Puter.js
Add xAI: Grok 4.1 Fast Non-Reasoning to your app without worrying about API keys or setup.
Read the Docs View Tutorials