How much does GPT-5 cost per 1M tokens?

GPT-5 costs $1.25 per 1M input tokens and $10.00 per 1M output tokens. It has a 272K token context window and is OpenAI's mid-tier model.

How much does Gemini 3.5 Flash cost per 1M tokens?

Gemini 3.5 Flash costs $1.50 per 1M input tokens and $9.00 per 1M output tokens. It has a 1M token context window — 3.7x larger than GPT-5.

Is GPT-5 better than Gemini 3.5 Flash?

GPT-5 excels at complex reasoning and nuanced tasks. Gemini 3.5 Flash has a much larger context window (1M vs 272K) and is slightly cheaper on output tokens. The choice depends on whether you need reasoning depth (GPT-5) or context capacity (Gemini 3.5 Flash).

Which is cheaper: GPT-5 or Gemini 3.5 Flash?

GPT-5 is cheaper on input ($1.25 vs $1.50) while Gemini 3.5 Flash is cheaper on output ($9.00 vs $10.00). For output-heavy workloads, Gemini saves ~10%. For input-heavy workloads, GPT-5 saves ~17%.

🔥 Limited time: Pro lifetime access $29 — price goes up July 12 →

Mid-Tier June 26, 2026 7 min read

GPT-5 vs Gemini 3.5 Flash: Mid-Tier Showdown

💡 Key insight: These models are priced within 20% of each other, but Gemini 3.5 Flash has a 3.7x larger context window (1M vs 272K tokens). If you're processing long documents, Gemini wins. Use our free calculator to compare costs for your exact usage.

GPT-5 is OpenAI's workhorse. Gemini 3.5 Flash is Google's speed demon. Both are mid-tier models with nearly identical pricing — but very different strengths.

Pricing Comparison

GPT-5

OpenAI

$1.25 / $10.00

per 1M tokens (input / output)

Gemini 3.5 Flash

Google

$1.50 / $9.00

per 1M tokens (input / output)

Feature	GPT-5	Gemini 3.5 Flash
Input price (per 1M tokens)	$1.25 ✅	$1.50
Output price (per 1M tokens)	$10.00	$9.00 ✅
Context window	272K tokens	1M tokens ✅
Provider	OpenAI	Google
Best for	Complex reasoning, nuanced tasks	High-volume, context-heavy tasks
Function calling	Excellent	Excellent
Vision	Yes	Yes
Speed	Fast	Faster ✅

Cost Per Use Case

Use Case	Tokens (in/out)	GPT-5	Gemini 3.5 Flash	Cheaper
Chatbot response	2K / 500	$0.008	$0.008	Tie
Code generation	5K / 2K	$0.026	$0.026	Tie
Document summary	10K / 1K	$0.023	$0.024	GPT-5
Long document analysis	100K / 5K	$0.175	$0.195	GPT-5
RAG pipeline	15K / 3K	$0.049	$0.050	~Tie

The verdict on pricing: These models are so close in price that the difference is negligible for most workloads. The real differentiator is the context window — Gemini 3.5 Flash's 1M token window is 3.7x larger than GPT-5's 272K.

When to Choose GPT-5

            ✅ Choose GPT-5 when:
            Complex reasoning matters — Multi-step logic, mathematical proofs, nuanced analysis
You're in the OpenAI ecosystem — Using OpenAI SDK, Assistants API, etc.
Input-heavy workloads — GPT-5 is 17% cheaper on input tokens
You need reliable function calling — OpenAI's tool use is industry-leading
Context window is sufficient — 272K is enough for your use case

        

When to Choose Gemini 3.5 Flash

            ✅ Choose Gemini 3.5 Flash when:
            You need long context — 1M tokens handles entire codebases, long documents, multi-page PDFs
Speed matters — Gemini 3.5 Flash is optimized for low latency
High-volume processing — Content generation, data extraction, classification
You're in the Google ecosystem — Using Google Cloud, Vertex AI, etc.
Output-heavy workloads — Gemini is 10% cheaper on output tokens

        

The Context Window Factor

📐 Context window comparison

GPT-5

272K

tokens

Gemini 3.5 Flash

tokens

Gemini's context window is 3.7x larger — enough to process entire codebases or multi-hundred-page documents in a single request.

Real-World Cost Comparison

For a typical SaaS application processing 100K requests per month:

📊 Monthly cost for 100K requests (avg 3K input + 1K output per request)GPT-5
$1,375/mo
$16,500/year
Gemini 3.5 Flash
$1,350/mo
$16,200/year
Annual difference
~$300 (negligible)

The real question isn't price — it's capability. Choose based on whether you need GPT-5's reasoning depth or Gemini's context capacity.

Verdict

🏆 Winner: It depends on your needs

These models are so close in price that the choice comes down to capability, not cost.

Choose GPT-5 if you need the best reasoning and your context fits in 272K tokens.

Choose Gemini 3.5 Flash if you need 1M context or faster response times.

Want to see the full cost comparison across all 42 models?

Our free calculator shows you exactly how much you'll save by switching.

Compare All Models — Free →