Limited time: Pro lifetime access $19 — price goes up July 12

← Back to Blog

Cheapest AI API in July 2026: Every Model Ranked by Cost

Ranked list of every AI API by cost. Find the cheapest model for your use case. From $0.08/1M tokens to $30/1M tokens — full ranking with interactive calculator.

AI API pricing spans a massive range in July 2026. The cheapest models cost $0.075 per 1M input tokens, while the most expensive hit $180. That is a 2,400x difference. Choosing the right model for your workload is the single biggest cost lever for any AI application. This guide ranks every model by cost and helps you find the cheapest option for your specific use case.

Top 10 Cheapest AI Models (Ranked by Input Cost)

These are the cheapest models available today, ranked by input token cost. All prices are per 1M tokens.

# Model Input / 1M Output / 1M Provider
1 Gemini 2.5 Flash-Lite $0.075 $0.30 Google
2 Mistral Small 4 $0.10 $0.30 Mistral
3 Gemini 2.5 Flash-Lite $0.10 $0.40 Google
4 DeepSeek V4 Flash $0.14 $0.28 DeepSeek
5 GPT-4o mini $0.15 $0.60 OpenAI
6 Llama 4 Scout $0.18 $0.59 Meta
7 Llama 4 Maverick $0.20 $0.60 Meta
8 GPT-5 mini $0.25 $2.00 OpenAI
9 DeepSeek V4 Pro $0.435 $0.87 DeepSeek
10 Mistral Large 3 $0.50 $1.50 Mistral
Key Insight

Gemini 2.5 Flash-Lite is the cheapest API overall at $0.075/1M input tokens. But cheapest input does not always mean cheapest total cost — DeepSeek V4 Flash at $0.14/$0.28 has cheaper output tokens, making it better for generation-heavy workloads. GPT-5 mini at $0.25/$2 is the cheapest option from a major provider for general-purpose use.

Top 10 by Output Cost (Cheapest to Generate)

Output tokens are typically 3-10x more expensive than input tokens. For workloads that generate long responses (chatbots, code generation, content writing), output cost matters more than input cost.

# Model Output / 1M Input / 1M Provider
1 DeepSeek V4 Flash $0.28 $0.14 DeepSeek
2 Mistral Small 4 $0.30 $0.10 Mistral
3 Gemini 2.5 Flash-Lite $0.30 $0.075 Google
4 Gemini 2.5 Flash-Lite $0.40 $0.10 Google
5 GPT-4o mini $0.60 $0.15 OpenAI
6 Llama 4 Scout $0.59 $0.18 Meta
7 Llama 4 Maverick $0.60 $0.20 Meta
8 DeepSeek V4 Pro $0.87 $0.435 DeepSeek
9 Mistral Large 3 $1.50 $0.50 Mistral
10 GPT-5 mini $2.00 $0.25 OpenAI

Budget Tier: Models Under $1/1M Input

Budget Tier — Under $1/1M Input

These models are ideal for high-volume, low-complexity tasks: classification, extraction, simple Q&A, FAQ chatbots, data labeling, and content moderation. They handle 80% of typical API workloads at a fraction of the cost.

Model Input / 1M Output / 1M Best For
Gemini 2.5 Flash-Lite $0.075 $0.30 Simple classification, extraction
Mistral Small 4 $0.10 $0.30 General budget tasks
DeepSeek V4 Flash $0.14 $0.28 High-volume generation
GPT-4o mini $0.15 $0.60 OpenAI ecosystem integration
Llama 4 Scout $0.18 $0.59 Self-hosted, on-prem
GPT-5 mini $0.25 $2.00 Smart budget with reasoning
DeepSeek V4 Pro $0.435 $0.87 Budget code generation
Mistral Large 3 $0.50 $1.50 Mid-quality at budget price
Budget Tier Insight

GPT-5 mini is the smart budget pick. At $0.25/$2, it offers significantly better reasoning than other budget models. For pure cost, Gemini Flash-Lite and Mistral Small 4 are cheaper, but GPT-5 mini handles complex tasks that trip up other budget models. If you need both cheap and capable, GPT-5 mini is the sweet spot.

Mid Tier: Models $1-5/1M Input

Mid Tier — $1-5/1M Input

The mid tier is where production applications live. These models balance quality and cost, handling complex reasoning, code generation, and multi-step analysis without the premium price tag.

Model Input / 1M Output / 1M Best For
Claude Haiku 4.5 $1.00 $5.00 Fast, high-quality budget Anthropic
GPT-5 $1.25 $10.00 Best mid-tier value overall
Gemini 2.5 Pro $1.25 $10.00 Google ecosystem, long context
Grok 4.3 $1.25 $2.50 Fast inference, social data
GPT-5.3 Codex $1.75 $14.00 Code generation specialist
Gemini 3.1 Pro $2.00 $12.00 Latest Google premium
Claude Sonnet 5 $3.00 $15.00 High-quality analysis, writing
Claude Sonnet 4.6 $3.00 $15.00 Proven Anthropic workhorse

Premium Tier: Models $5+/1M Input

Premium Tier — $5+/1M Input

Premium models are for the hardest problems: complex multi-step reasoning, advanced coding, creative writing, research analysis, and tasks requiring deep domain knowledge. Use them sparingly — most workloads do not need this level.

Model Input / 1M Output / 1M Best For
GPT-5.5 $5.00 $30.00 Complex reasoning, multi-file code
Claude Opus 4.8 $5.00 $25.00 Deep analysis, long-form writing
Claude Opus 4.7 $5.00 $25.00 Proven premium Anthropic
Claude Fable 5 $5.00 $25.00 Creative tasks, storytelling
Claude Mythos 5 $5.00 $25.00 Abstract reasoning, research
GPT-4o $2.50 $10.00 Vision, multimodal
GPT-5.5 Pro $30.00 $180.00 Extreme reasoning (very expensive)

Use Case Recommendations

Not sure which tier to pick? Here is a recommendation based on your specific use case:

Chatbots and Customer Support

Budget Tier Recommended

Customer support bots handle simple questions 80% of the time. Use GPT-5 mini ($0.25/$2) for routine questions and route complex issues to a mid-tier model. This hybrid approach costs 80-90% less than using a premium model for everything.

Code Generation and Review

Mid Tier Recommended

Code generation needs reasoning capability but not premium. GPT-5 ($1.25/$10) handles most coding tasks well. For complex multi-file refactoring, upgrade to GPT-5.3 Codex ($1.75/$14) or Opus 4.8 ($5/$25) only when needed.

Content Writing and Marketing

Mid Tier Recommended

Blog posts, emails, and marketing copy work well with Claude Sonnet 5 ($3/$15) or GPT-5 ($1.25/$10). For long-form articles or creative content that needs a distinctive voice, Opus 4.8 ($5/$25) produces noticeably better output.

Data Extraction and Classification

Budget Tier Recommended

Structured extraction and classification are where budget models shine. Mistral Small 4 ($0.10/$0.30) or GPT-4o mini ($0.15/$0.60) handle these tasks at 95%+ accuracy for a fraction of premium costs. Save your budget for tasks that actually need intelligence.

Complex Reasoning and Research

Premium Tier Required

Multi-step reasoning, mathematical proofs, research analysis, and complex decision-making require premium models. GPT-5.5 ($5/$30) or Claude Opus 4.8 ($5/$25) are the only options that handle these tasks reliably. There is no budget shortcut here.

Translation and Summarization

Budget Tier Recommended

Translation and summarization are well-solved problems. GPT-4o mini ($0.15/$0.60) or DeepSeek V4 Flash ($0.14/$0.28) produce quality comparable to premium models for these specific tasks. Premium models add marginal quality at 50-100x the cost.

Full Cost Comparison Table

Every model in one table, sorted by input cost. This is your complete reference for AI API pricing in July 2026.

Model Provider Input / 1M Output / 1M Tier
Gemini 2.5 Flash-Lite Google $0.075 $0.30 Budget
Mistral Small 4 Mistral $0.10 $0.30 Budget
Gemini 2.5 Flash-Lite Google $0.10 $0.40 Budget
DeepSeek V4 Flash DeepSeek $0.14 $0.28 Budget
GPT-4o mini OpenAI $0.15 $0.60 Budget
Llama 4 Scout Meta $0.18 $0.59 Budget
Llama 4 Maverick Meta $0.20 $0.60 Budget
GPT-5 mini OpenAI $0.25 $2.00 Budget
DeepSeek V4 Pro DeepSeek $0.435 $0.87 Budget
Mistral Large 3 Mistral $0.50 $1.50 Budget
Claude Haiku 4.5 Anthropic $1.00 $5.00 Mid
GPT-5 OpenAI $1.25 $10.00 Mid
Gemini 2.5 Pro Google $1.25 $10.00 Mid
Grok 4.3 xAI $1.25 $2.50 Mid
GPT-5.3 Codex OpenAI $1.75 $14.00 Mid
Gemini 3.1 Pro Google $2.00 $12.00 Mid
GPT-4o OpenAI $2.50 $10.00 Mid
Command R+ Cohere $2.50 $10.00 Mid
Claude Sonnet 5 Anthropic $3.00 $15.00 Mid
Claude Sonnet 4.6 Anthropic $3.00 $15.00 Mid
GPT-5.5 OpenAI $5.00 $30.00 Premium
Claude Opus 4.8 Anthropic $5.00 $25.00 Premium
Claude Opus 4.7 Anthropic $5.00 $25.00 Premium
Claude Fable 5 Anthropic $5.00 $25.00 Premium
Claude Mythos 5 Anthropic $5.00 $25.00 Premium
GPT-5.5 Pro OpenAI $30.00 $180.00 Premium

Interactive Cost Calculator

Enter your tokens and requests to see exactly what each model costs for your workload.

Try It Live — Cost Calculator

See exactly what any model costs for your workload. No signup needed.

Key Takeaways

  1. Cheapest overall: Gemini 2.5 Flash-Lite at $0.075/$0.30 per 1M tokens. For output-heavy work, DeepSeek V4 Flash at $0.14/$0.28 is cheapest.
  2. Cheapest from OpenAI: GPT-4o mini at $0.15/$0.60. For smarter budget tasks, GPT-5 mini at $0.25/$2 is the sweet spot.
  3. Cheapest from Anthropic: Claude Haiku 4.5 at $1/$5. No budget-tier option — Anthropic focuses on quality over price.
  4. Best mid-tier value: GPT-5 at $1.25/$10 — near-premium quality at mid-tier pricing. Handles 90% of production workloads.
  5. Cheapest premium: Claude Opus 4.8 at $5/$25 — $5 cheaper per 1M output tokens than GPT-5.5. With 90% caching, it is the premium value pick.
  6. Avoid GPT-5.5 Pro unless necessary: At $30/$180, it costs 6x more than other premium models. Only use for extreme reasoning tasks.
  7. Match model to task: Classification and extraction need budget models. Code and analysis need mid-tier. Complex reasoning needs premium. Most developers overpay by using premium for budget-level tasks.

Find the Cheapest Model for Your Exact Workload

Use our comparison tool to test any model against your real tokens and requests. See exact daily and monthly costs, not just list prices.

Try the Cost Calculator → Compare All Models →

Pro tip: APIpulse Cost Explorer — visualize pricing across all 49 models and find the cheapest option for any workload.

Stop overpaying for AI APIs

APIpulse Pro ($19) includes real-time pricing for all 49 models, scenario saving, and cost comparison exports that help you save 40%+ on AI API costs.

Get Pro — $19 Lifetime
Take the Free Cost Health Check →

Find out if you are overpaying in 30 seconds

Track Your API Spending →

Log costs, set budgets, detect price changes — free dashboard