Blog

Gemini 3.1 Flash Lite Is Now Available in Puter.js

On this page

Puter.js now supports Gemini 3.1 Flash Lite, Google's fastest and most cost-efficient model in the Gemini 3 series—optimized for high-volume, latency-sensitive production workloads.

What is Gemini 3.1 Flash Lite?

Gemini 3.1 Flash Lite is Google's latest cost-optimized model, launched on March 3, 2026. Priced at just $0.25/1M input tokens and $1.50/1M output tokens, it delivers frontier-class speed at a fraction of the cost. Compared to Gemini 2.5 Flash, it offers 2.5x faster time-to-first-token and a 45% boost in output speed. Key capabilities include:

  • 1M token context window: Process large documents, datasets, and multimodal inputs in a single prompt
  • 64K token output: Generate detailed responses for complex tasks
  • Multimodal input: Accepts text, images, audio, video, and PDF inputs
  • Cost-efficient at scale: Ideal for translation, classification, content moderation, and UI generation

Examples

Text classification

puter.ai.chat("Classify the following review as positive, negative, or neutral: 'Great product but shipping took forever'",
  { model: 'gemini-3.1-flash-lite-preview' }
);

Translation

puter.ai.chat("Translate the following to French, Spanish, and Japanese: 'Welcome to our platform'",
  { model: 'gemini-3.1-flash-lite-preview' }
);

Content moderation

puter.ai.chat("Does the following text contain harmful content? Respond with yes or no and explain: 'I love sunny days at the park'",
  { model: 'gemini-3.1-flash-lite-preview' }
);

Get Started Now

Just add one library to your project:

// npm install @heyputer/puter.js
import { puter } from '@heyputer/puter.js';

Or add one script tag to your HTML:

<script src="https://js.puter.com/v2/"></script>

No API keys and no infrastructure setup. Start building with Gemini 3.1 Flash Lite immediately.

Learn more:

Free, Serverless AI and Cloud

Start creating powerful web applications with Puter.js in seconds!

Get Started Now

Read the Docs Try the Playground