google/gemini-3-flash-preview
Model Card
Gemini 3 Flash is Google's frontier intelligence model built for speed, combining Pro-grade reasoning with Flash-level latency at a fraction of the cost. It excels at agentic coding, complex analysis, and multimodal understanding with configurable thinking levels.
Context Window 1M
tokens
Max Output 66K
tokens
Input Cost $0.5
per million tokens
Output Cost $3
per million tokens
Input text, image, video, audio, pdf
modalities
Tool Use Yes
Knowledge Cutoff Jan 2025
Release Date Dec 17, 2025
API Usage Example
Add Gemini 3 Flash to your app with just a few lines of code.
No API keys, no backend, no configuration required.
// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';
puter.ai.chat("Explain quantum computing in simple terms", {
model: "google/gemini-3-flash-preview"
}).then(response => {
document.body.innerHTML = response.message.content;
});
<html>
<body>
<script src="https://js.puter.com/v2/"></script>
<script>
puter.ai.chat("Explain quantum computing in simple terms", {
model: "google/gemini-3-flash-preview"
}).then(response => {
document.body.innerHTML = response.message.content;
});
</script>
</body>
</html>
More Models from Google
Gemini 3.1 Flash Lite Preview
Gemini 3.1 Flash Lite is Google's fastest and most cost-efficient model in the Gemini 3 series, opti...
ImageGemini 3.1 Flash Image
Gemini 3.1 Flash Image (also known as Nano Banana 2) is Google DeepMind's latest state-of-the-art im...
ChatGemini 3.1 Pro
Gemini 3.1 Pro is Google's most advanced reasoning model, building on the Gemini 3 series with over ...
Frequently Asked Questions
The Gemini 3 Flash API gives you access to Google's chat model through Puter.js. With just a few lines of JavaScript, you can integrate Gemini 3 Flash into any web app or Node.js project — no API keys, no backend, and no configuration required.
Gemini 3 Flash was created by Google and released on Dec 17, 2025.
Gemini 3 Flash supports a context window of 1M tokens. For reference, that is roughly equivalent to 2,097 pages of text.
Gemini 3 Flash can generate up to 66K tokens in a single response.
Gemini 3 Flash has a knowledge cutoff date of Jan 2025. This means the model was trained on data available up to that date.
Gemini 3 Flash accepts the following input types: text, image, video, audio, pdf. It produces: text.
Yes, Gemini 3 Flash supports tool use (function calling), allowing it to interact with external tools, APIs, and data sources as part of its response flow.
| Price per 1M tokens | |
|---|---|
| Input | $0.5 |
| Output | $3 |
You can access the Gemini 3 Flash API with just a few lines of JavaScript — no API keys, no backend, and no configuration required. Include the Puter.js library in your project and start making calls right away. For more details, check out our documentation.
Yes — the Gemini 3 Flash API works with any JavaScript framework, Node.js, or plain HTML through Puter.js. Just include the library and start building. See the documentation for more details.
Get started with Puter.js
Add Gemini 3 Flash to your app without worrying about API keys or setup.
Read the Docs View Tutorials