Pricing

All AI API Models Under $1/M in June 2026 — Complete Price List

15 models from $0.075 to $0.95 per 1M input tokens. The complete list of every affordable AI API available today.

Published Jun 10, 2026 · 10 min read · Updated with latest pricing

AI just got incredibly cheap. In June 2026, there are 15 API models that cost under $1 per million input tokens — down from just 3 a year ago. The cheapest starts at $0.075/M, which means you can process 13 million tokens for a single dollar. This is the complete, sorted list of every model under $1/M.

We track 15 models across 8 providers that fall under the $1/M threshold. All prices are per 1M tokens, verified as of June 10, 2026.

Complete Ranking: All 15 Models Under $1/M

Every AI API model priced under $1/M input tokens, ranked by total cost (input + output) per 1M tokens. Sorted cheapest first.

# Model Input Output Total Context Provider
1 Llama 3.1 8B $0.10 $0.10 $0.20 128K Meta
2 Gemini 2.0 Flash Lite $0.075 $0.30 $0.375 1M Google
3 GPT-oss 20B $0.08 $0.35 $0.43 128K OpenAI
4 DeepSeek V4 Flash $0.14 $0.28 $0.42 1M DeepSeek
5 Gemini 2.0 Flash $0.10 $0.40 $0.50 1M Google
6 DeepSeek V3.2 $0.23 $0.34 $0.57 128K DeepSeek
7 GPT-oss 120B $0.15 $0.60 $0.75 128K OpenAI
8 GPT-4o mini $0.15 $0.60 $0.75 128K OpenAI
9 Mistral Small 4 $0.15 $0.60 $0.75 128K Mistral
10 Llama 4 Scout $0.18 $0.59 $0.77 1M Meta
11 Llama 4 Maverick $0.27 $0.85 $1.12 1M Meta
12 DeepSeek V4 Pro $0.435 $0.87 $1.305 1M DeepSeek
13 Mistral Large 3 $0.50 $1.50 $2.00 262K Mistral
14 Command R $0.50 $1.50 $2.00 128K Cohere
15 Kimi K2.6 $0.95 $4.00 $4.95 256K Moonshot

Notice something interesting: the cheapest model by input price isn't the cheapest by total cost. Llama 3.1 8B costs $0.10/M input but only $0.10/M output, giving it the lowest total at $0.20. Gemini Flash Lite has the lowest input at $0.075 but costs 3x more on output.

Lowest Total Cost
$0.20
Llama 3.1 8B ($0.10 in / $0.10 out)
Lowest Input Price
$0.075
Gemini 2.0 Flash Lite ($0.075 in / $0.30 out)

Calculate Your Exact Costs

Enter your token usage and see exactly how much each of these 15 models costs for your workload.

Open Cost Calculator →

Price Tier Breakdown

These 15 models split into three clear tiers based on input pricing. Here's what you get at each level.

Ultra-Cheap

Under $0.15/M input

  • Gemini 2.0 Flash Lite — $0.075/$0.30 · 1M context · Google's cheapest model
  • GPT-oss 20B — $0.08/$0.35 · 128K context · OpenAI's open-source entry
  • Llama 3.1 8B — $0.10/$0.10 · 128K context · Open source, lowest total cost
  • Gemini 2.0 Flash — $0.10/$0.40 · 1M context · Google's balanced budget option
  • DeepSeek V4 Flash — $0.14/$0.28 · 1M context · Best budget coding model

Best for: high-volume classification, simple Q&A, data extraction, embedding pipelines

Cheap

$0.15 — $0.30/M input

  • GPT-oss 120B — $0.15/$0.60 · 128K context · Strong general-purpose
  • GPT-4o mini — $0.15/$0.60 · 128K context · OpenAI's budget workhorse
  • Mistral Small 4 — $0.15/$0.60 · 128K context · EU data sovereignty
  • Llama 4 Scout — $0.18/$0.59 · 1M context · Open source, MIT license
  • DeepSeek V3.2 — $0.23/$0.34 · 128K context · Proven production model
  • Llama 4 Maverick — $0.27/$0.85 · 1M context · Open source flagship

Best for: chatbots, content generation, RAG pipelines, code assistance

Budget-Friendly

$0.40 — $1.00/M input

  • DeepSeek V4 Pro — $0.435/$0.87 · 1M context · Best value for complex tasks
  • Mistral Large 3 — $0.50/$1.50 · 262K context · Strong at RAG and retrieval
  • Command R — $0.50/$1.50 · 128K context · Cohere's enterprise RAG model
  • Kimi K2.6 — $0.95/$4.00 · 256K context · Excellent reasoning capabilities

Best for: code generation, complex analysis, RAG, nuanced writing

Use Case Recommendations

Different tasks need different models. Here's the best sub-$1 model for each major use case.

💬

Chatbot

DeepSeek V4 Flash

$0.14/$0.28 — cheapest model that handles multi-turn conversations naturally. Used in production by thousands of apps.

💻

Code Generation

DeepSeek V4 Pro

$0.435/$0.87 — outperforms GPT-4o on coding benchmarks at 80% less cost. Best value coding model under $1.

📚

RAG Pipeline

Command R

$0.50/$1.50 — purpose-built for retrieval-augmented generation with strong context following and factual accuracy.

✍️

Content Writing

Kimi K2.6

$0.95/$4.00 — excellent reasoning and long-form generation. The most capable writing model under $1/M input.

📊

Data Extraction

Llama 3.1 8B

$0.10/$0.10 — lowest total cost at $0.20/M. Perfect for high-volume structured extraction tasks.

🌐

Multilingual

Mistral Small 4

$0.15/$0.60 — strong multilingual support with EU data sovereignty. Handles 30+ languages well.

🔒

No Vendor Lock-in

Llama 4 Scout

$0.18/$0.59 — open source MIT license, self-hostable, 1M context window. Full control over your stack.

Highest Volume

Gemini 2.0 Flash Lite

$0.075/$0.30 — cheapest input price of any model. When you need to process millions of tokens daily.

Track Every Dollar with APIpulse Pro

Set cost alerts, compare models in real-time, and optimize your API spend across all 15 budget models. $29/month.

Get APIpulse Pro →

Provider Breakdown

Eight providers offer models under $1/M input in June 2026. Here's how they compare.

Compare All 53 Models Side by Side

Our comparison tool lets you filter by price, context window, provider, and capabilities across every tracked model.

Open Comparison Tool →

The Bottom Line

The era of expensive AI is over. With 15 models under $1/M input tokens, every startup and indie developer can afford production-quality AI. The cheapest option, Gemini 2.0 Flash Lite at $0.075/M, lets you process 13 million tokens for a dollar.

Here's the quick decision tree for choosing among these 15 models:

Use the APIpulse cost calculator to model your exact usage and find the cheapest model that meets your quality bar.

Stay ahead of API pricing changes

Get notified when providers change prices, deprecate models, or launch new ones. Join 2,400+ developers.