xAI: Grok 4 Fast Non-Reasoning API

Access xAI: Grok 4 Fast Non-Reasoning from xAI using Puter.js AI API.

Get Started

Model Card

Grok 4 Fast Non-Reasoning is the speed-optimized, non-reasoning variant of Grok 4 Fast that bypasses extended chain-of-thought for instant responses. It uses the same unified architecture with a 2M context window but skips deliberation, delivering up to 342 tokens/second throughput for high-volume, latency-sensitive applications.

Context Window

N/A

tokens

Max Output

2M

tokens

Input Cost

$0.2

per million tokens

Output Cost

$0.5

per million tokens

Release Date

Jul 11, 2025

 

API Usage Example

Add xAI: Grok 4 Fast Non-Reasoning to your app with just a few lines of code.
No API keys, no backend, no configuration required.

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain quantum computing in simple terms", {
            model: "x-ai/grok-4-fast-non-reasoning"
        }).then(response => {
            document.body.innerHTML = response.message.content;
        });
    </script>
</body>
</html>

View full documentation →

Get started with Puter.js

Add xAI: Grok 4 Fast Non-Reasoning to your app without worrying about API keys or setup.

Read the Docs View Tutorials