Gemini 3 Flash vs Gemini 3.1 Flash-Lite

Side-by-side API pricing comparison: which model gives you more for less?

Last verified Jun 2026 · Prices per 1M tokens

Cheaper Input
Gemini 3.1 Flash-Lite
$0.25 vs $0.5
Cheaper Output
Gemini 3.1 Flash-Lite
$1.5 vs $3
Max Savings
50%
by switching to Gemini 3.1 Flash-Lite
Save up to 50%
Switch from Gemini 3 Flash to Gemini 3.1 Flash-Lite · $0.25 input / $1.5 output per 1M tokens

Quick Comparison

FeatureGemini 3 FlashGemini 3.1 Flash-Lite
Provider Google Google
Tier Budget Budget
Input Price $0.5 $0.25
Output Price $3 $1.5
Context Window 1M 1M
Verified Jun 2026 Jun 2026

When to Use Each

💰 Cost-Sensitive Workloads

High-volume APIs, batch processing, and startups watching runway.

→ Use Gemini 3.1 Flash-Lite — 50% cheaper input, 50% cheaper output

🧠 Complex Reasoning

Tasks requiring advanced reasoning, code generation, or nuanced analysis.

→ Either model works — compare quality on your specific tasks

⚡ High-Throughput APIs

Real-time chatbots, streaming responses, and latency-sensitive apps.

→ Gemini 3.1 Flash-Lite for cost at scale, Gemini 3 Flash if quality matters more

🔬 Prototyping & Testing

Development, experimentation, and non-critical workloads.

→ Use Gemini 3.1 Flash-Lite — same context (1M), much cheaper

Track price changes for both models

APIpulse Pro monitors 49 models across 10 providers. Get alerts when Gemini 3 Flash or Gemini 3.1 Flash-Lite prices change.

Get Pro for $19 →

Frequently Asked Questions

Is Gemini 3.1 Flash-Lite cheaper than Gemini 3 Flash?

Yes. Gemini 3.1 Flash-Lite costs $0.25 input / $1.5 output per 1M tokens, while Gemini 3 Flash costs $0.5 input / $3 output. That's 50% cheaper on input and 50% cheaper on output.

How much can I save switching to Gemini 3.1 Flash-Lite?

For a typical workload (1M input + 500K output tokens/month), Gemini 3.1 Flash-Lite costs $1.00/month vs $2.00/month for Gemini 3 Flash. That's a savings of $1.00/month (50%).

Which should I choose: Gemini 3 Flash or Gemini 3.1 Flash-Lite?

Choose Gemini 3.1 Flash-Lite for cost efficiency. Choose Gemini 3 Flash for Google ecosystem benefits. Both have 1M context windows.