Gemini 3.1 Flash Lite Is Now Available in Puter.js
On this page
Puter.js now supports Gemini 3.1 Flash Lite, Google's fastest and most cost-efficient model in the Gemini 3 series—optimized for high-volume, latency-sensitive production workloads.
What is Gemini 3.1 Flash Lite?
Gemini 3.1 Flash Lite is Google's latest cost-optimized model, launched on March 3, 2026. Priced at just $0.25/1M input tokens and $1.50/1M output tokens, it delivers frontier-class speed at a fraction of the cost. Compared to Gemini 2.5 Flash, it offers 2.5x faster time-to-first-token and a 45% boost in output speed. Key capabilities include:
- 1M token context window: Process large documents, datasets, and multimodal inputs in a single prompt
- 64K token output: Generate detailed responses for complex tasks
- Multimodal input: Accepts text, images, audio, video, and PDF inputs
- Cost-efficient at scale: Ideal for translation, classification, content moderation, and UI generation
Examples
Text classification
puter.ai.chat("Classify the following review as positive, negative, or neutral: 'Great product but shipping took forever'",
{ model: 'gemini-3.1-flash-lite-preview' }
);
Translation
puter.ai.chat("Translate the following to French, Spanish, and Japanese: 'Welcome to our platform'",
{ model: 'gemini-3.1-flash-lite-preview' }
);
Content moderation
puter.ai.chat("Does the following text contain harmful content? Respond with yes or no and explain: 'I love sunny days at the park'",
{ model: 'gemini-3.1-flash-lite-preview' }
);
Get Started Now
Just add one library to your project:
// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';
Or add one script tag to your HTML:
<script src="https://js.puter.com/v2/"></script>
No API keys and no infrastructure setup. Start building with Gemini 3.1 Flash Lite immediately.
Learn more:
Free, Serverless AI and Cloud
Start creating powerful web applications with Puter.js in seconds!
Get Started Now