Gemini 3 Flash API

Access Gemini 3 Flash from Google using Puter.js AI API.

Get Started

google/gemini-3-flash-preview

Model Card

Gemini 3 Flash is Google's frontier intelligence model built for speed, combining Pro-grade reasoning with Flash-level latency at a fraction of the cost. It excels at agentic coding, complex analysis, and multimodal understanding with configurable thinking levels.

Context Window 1M

tokens

Max Output 66K

tokens

Input Cost $0.5

per million tokens

Output Cost $3

per million tokens

Input text, image, video, audio, pdf

modalities

Tool Use Yes

Knowledge Cutoff Jan 2025

Release Date Dec 17, 2025

API Usage Example

Add Gemini 3 Flash to your app with just a few lines of code.
No API keys, no backend, no configuration required.

// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';

puter.ai.chat("Explain quantum computing in simple terms", {
    model: "google/gemini-3-flash-preview"
}).then(response => {
    document.body.innerHTML = response.message.content;
});

<html>
<body>
    <script src="https://js.puter.com/v2/"></script>
    <script>
        puter.ai.chat("Explain quantum computing in simple terms", {
            model: "google/gemini-3-flash-preview"
        }).then(response => {
            document.body.innerHTML = response.message.content;
        });
    </script>
</body>
</html>

View full documentation →

More Models from Google

Chat

Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite is Google's fastest and most cost-efficient model in the Gemini 3 series, opti...

Image

Gemini 3.1 Flash Image

Gemini 3.1 Flash Image (also known as Nano Banana 2) is Google DeepMind's latest state-of-the-art im...

Chat

Gemini 3.1 Pro

Gemini 3.1 Pro is Google's most advanced reasoning model, building on the Gemini 3 series with over ...

View all Google models →

Frequently Asked Questions

What is this Gemini 3 Flash API about?

The Gemini 3 Flash API gives you access to Google's chat model through Puter.js. With just a few lines of JavaScript, you can integrate Gemini 3 Flash into any web app or Node.js project — no API keys, no backend, and no configuration required.

Who created Gemini 3 Flash?

Gemini 3 Flash was created by Google and released on Dec 17, 2025.

What is the context window of Gemini 3 Flash?

Gemini 3 Flash supports a context window of 1M tokens. For reference, that is roughly equivalent to 2,097 pages of text.

What is the max output length of Gemini 3 Flash?

Gemini 3 Flash can generate up to 66K tokens in a single response.

What is the knowledge cutoff of Gemini 3 Flash?

Gemini 3 Flash has a knowledge cutoff date of Jan 2025. This means the model was trained on data available up to that date.

What types of input can Gemini 3 Flash process?

Gemini 3 Flash accepts the following input types: text, image, video, audio, pdf. It produces: text.

Does Gemini 3 Flash support tool use (function calling)?

Yes, Gemini 3 Flash supports tool use (function calling), allowing it to interact with external tools, APIs, and data sources as part of its response flow.

How much does it cost?

The Gemini 3 Flash API is available through the User-Pays Model. As a developer, you can add the Gemini 3 Flash API to your app for free — your users pay for their own AI costs directly.

	Price per 1M tokens
Input	$0.5
Output	$3

How do I access the Gemini 3 Flash API?

You can access the Gemini 3 Flash API with just a few lines of JavaScript — no API keys, no backend, and no configuration required. Include the Puter.js library in your project and start making calls right away. For more details, check out our documentation.

Does the Gemini 3 Flash API work with React / Vue / Vanilla JS / Node / etc.?

Yes — the Gemini 3 Flash API works with any JavaScript framework, Node.js, or plain HTML through Puter.js. Just include the library and start building. See the documentation for more details.

Get started with Puter.js

Add Gemini 3 Flash to your app without worrying about API keys or setup.

Read the Docs View Tutorials