Top 10 Cheapest AI APIs in 2026 (With Live Pricing Badges)

Updated June 2026 · 6 min read · 10 models ranked

AI API costs vary by 400x between the cheapest and most expensive models. If you're building on a budget, picking the right model can save you thousands per month. Here are the 10 cheapest AI APIs in 2026 — with live pricing badges you can embed in your README or docs.

TL;DR: The cheapest AI API is Gemini 2.0 Flash Lite at $0.075/$0.30 per 1M tokens. For quality-sensitive tasks, DeepSeek V4 Pro ($0.44/$0.87) is the best value. Every model below has a live pricing badge you can embed.

The 10 Cheapest AI APIs, Ranked

All prices are per 1M tokens, verified June 2026. The "cost per 1K requests" column assumes a typical 500-token input / 500-token output request.

# Model Provider Input Output Context Cost / 1K req
1 Gemini 2.0 Flash Lite Google $0.075 $0.30 1M $0.19
2 GPT-oss 20B OpenAI $0.08 $0.35 128K $0.22
3 Llama 3.1 8B Meta (Together.ai) $0.10 $0.10 128K $0.10
4 Gemini 2.0 Flash Google $0.10 $0.40 1M $0.25
5 Llama 4 Scout Meta (Together.ai) $0.11 $0.34 10M $0.23
6 DeepSeek V4 Flash DeepSeek $0.14 $0.28 1M $0.21
7 GPT-oss 120B OpenAI $0.15 $0.60 128K $0.38
8 Mistral Small 4 Mistral $0.15 $0.60 128K $0.38
9 GPT-5 mini OpenAI $0.25 $2.00 272K $1.13
10 DeepSeek V4 Pro DeepSeek $0.44 $0.87 1M $0.66

Live Pricing Badges — Embed in Your README

Each badge below shows live pricing that auto-updates when providers change their rates. Click any code block to copy the embed code. See all 34 badges on our Pricing Badges Gallery.

Gemini Flash Lite pricing

#1 — Gemini 2.0 Flash Lite

The absolute cheapest AI API. Great for classification, extraction, and simple Q&A.

![Gemini Flash Lite](https://getapipulse.com/api/badge?model=google-flash-lite)
GPT-oss 20B pricing

#2 — GPT-oss 20B

OpenAI's open-source budget model. Surprisingly capable for its price.

![GPT-oss 20B](https://getapipulse.com/api/badge?model=openai-gpt-oss-20b)
Llama 3.1 8B pricing

#3 — Llama 3.1 8B

Meta's smallest model via Together.ai. Equal input/output pricing makes cost predictable.

![Llama 3.1 8B](https://getapipulse.com/api/badge?model=llama-3.1-8b)
Gemini Flash pricing

#4 — Gemini 2.0 Flash

1M context window at $0.10 input. Best budget option for long-document processing.

![Gemini Flash](https://getapipulse.com/api/badge?model=google-flash)
Llama 4 Scout pricing

#5 — Llama 4 Scout

10M context window — the largest available. Dedicated inference via Together.ai.

![Llama 4 Scout](https://getapipulse.com/api/badge?model=llama-4-scout)
DeepSeek V4 Flash pricing

#6 — DeepSeek V4 Flash

DeepSeek's speed-optimized model. Best balance of cost and quality for most tasks.

![DeepSeek V4 Flash](https://getapipulse.com/api/badge?model=deepseek-v4-flash)
GPT-oss 120B pricing

#7 — GPT-oss 120B

OpenAI's larger open-source model. Strong reasoning at budget prices.

![GPT-oss 120B](https://getapipulse.com/api/badge?model=openai-gpt-oss-120b)
Mistral Small pricing

#8 — Mistral Small 4

Excellent for code completion and structured output. Strong European option.

![Mistral Small](https://getapipulse.com/api/badge?model=mistral-small)
GPT-5 mini pricing

#9 — GPT-5 mini

OpenAI's mid-budget option. 272K context with strong general capabilities.

![GPT-5 mini](https://getapipulse.com/api/badge?model=openai-gpt5-mini)
DeepSeek V4 Pro pricing

#10 — DeepSeek V4 Pro

The best price-to-quality ratio for complex tasks. 1M context, excellent at code and reasoning.

![DeepSeek V4 Pro](https://getapipulse.com/api/badge?model=deepseek-v4-pro)

How to Pick the Right Budget Model

Not every cheap model fits every task. Here's a quick guide:

The Massive Cost Gap: Budget vs Premium

To put these prices in perspective: the cheapest model (Gemini Flash Lite at $0.075/$0.30) is 400x cheaper on input and 600x cheaper on output than the most expensive model (GPT-5.5 Pro at $30/$180). Even the #10 cheapest model (DeepSeek V4 Pro at $0.44/$0.87) is 68x cheaper than premium tier.

For most startups and side projects, a top-10 budget model will handle 90%+ of your AI workloads. Reserve premium models for the 10% of tasks that truly need them.

Embed All 10 Badges at Once

Want to show all 10 prices in your project docs? Use our badges gallery to grab embed code for each model, or use the API directly:

# Add to your README.md
## AI API Pricing
[![Gemini Flash Lite](https://getapipulse.com/api/badge?model=google-flash-lite)](https://getapipulse.com)
[![GPT-oss 20B](https://getapipulse.com/api/badge?model=openai-gpt-oss-20b)](https://getapipulse.com)
[![Llama 3.1 8B](https://getapipulse.com/api/badge?model=llama-3.1-8b)](https://getapipulse.com)
[![DeepSeek V4 Flash](https://getapipulse.com/api/badge?model=deepseek-v4-flash)](https://getapipulse.com)
[![DeepSeek V4 Pro](https://getapipulse.com/api/badge?model=deepseek-v4-pro)](https://getapipulse.com)

# Full list of 34 badges: https://getapipulse.com/badges.html

Calculate Your Exact Costs

Enter your actual token usage and see costs across all 34 models. Find the cheapest option for your specific workload.

Open Calculator — Free

Key Takeaways

Methodology

All prices sourced directly from provider pricing pages, verified June 1, 2026. Prices are per 1M tokens. Cost per 1K requests assumes 500-token input + 500-token output per request. We track 34 models across 10 providers. Data is updated monthly. See pricing changelog →

Share on X Share on LinkedIn Share on Reddit

Embed live pricing badges: APIpulse Pricing Badges Gallery — 34 models, copy-paste Markdown/HTML, auto-updating.

Try it free: APIpulse Cost Calculator — estimate your monthly spend across 34 models and 10 providers in 30 seconds.