AI API Cost per Request: Quick Reference Table

How much does a single API call actually cost? We calculated it for all 33 models across 10 providers at four common request sizes. Bookmark this page.

Assumption: Each request sends 3x more input tokens than output tokens (typical for chat, RAG, and code assistant workloads). Costs are per single request. All prices verified May 2026.

All 33 Models — Cost per Request

Sorted cheapest to most expensive. At 1K tokens, costs range from $0.000100 (Llama 3.1 8B) to $0.067500 (GPT-5.5 Pro) — a 675x gap.

Model Tier Provider 100 tok 500 tok 1K tok 5K tok
Llama 3.1 8BBudgetMeta (Together.ai)$0.000010$0.000050$0.000100$0.000500
GPT-oss 20BBudgetOpenAI$0.000015$0.000074$0.000148$0.000737
Llama 4 ScoutBudgetMeta (Together.ai)$0.000017$0.000084$0.000168$0.000838
Gemini 2.0 FlashBudgetGoogle$0.000017$0.000087$0.000175$0.000875
DeepSeek V4 FlashBudgetDeepSeek$0.000018$0.000087$0.000175$0.000875
Mistral Small 4BudgetMistral$0.000026$0.000131$0.000262$0.001313
GPT-4o miniBudgetOpenAI$0.000026$0.000131$0.000262$0.001313
GPT-oss 120BBudgetOpenAI$0.000026$0.000131$0.000262$0.001313
Llama 4 MaverickBudgetMeta (Together.ai)$0.000030$0.000150$0.000300$0.001500
DeepSeek V3BudgetDeepSeek$0.000048$0.000239$0.000478$0.002387
DeepSeek V4 ProBudgetDeepSeek$0.000055$0.000274$0.000548$0.002737
GPT-5 miniBudgetOpenAI$0.000070$0.000350$0.000700$0.003500
Command RBudgetCohere$0.000075$0.000375$0.000750$0.003750
Mistral Large 3BudgetMistral$0.000075$0.000375$0.000750$0.003750
Llama 3.1 70BMidMeta (Together.ai)$0.000088$0.000440$0.000880$0.004400
Claude Haiku 4.5BudgetAnthropic$0.000160$0.000800$0.001600$0.008000
Kimi K2.6BudgetMoonshot$0.000161$0.000806$0.001613$0.008063
Gemini 2.5 ProMidGoogle$0.000344$0.001719$0.003438$0.017188
Grok 3 MiniMidxAI$0.000350$0.001750$0.003500$0.017500
Jamba 1.5 LargeMidAI21$0.000350$0.001750$0.003500$0.017500
Command R+MidCohere$0.000438$0.002188$0.004375$0.021875
GPT-4oMidOpenAI$0.000438$0.002188$0.004375$0.021875
Gemini 3.1 ProMidGoogle$0.000450$0.002250$0.004500$0.022500
GPT-5.3 CodexMidOpenAI$0.000481$0.002406$0.004812$0.024063
Claude Sonnet 4MidAnthropic$0.000600$0.003000$0.006000$0.030000
Claude Sonnet 4.6MidAnthropic$0.000600$0.003000$0.006000$0.030000
Claude Opus 4.7PremiumAnthropic$0.001000$0.005000$0.010000$0.050000
GPT-5.5PremiumOpenAI$0.001125$0.005625$0.011250$0.056250
GPT-5PremiumOpenAI$0.001500$0.007500$0.015000$0.075000
Claude 4 OpusPremiumAnthropic$0.003000$0.015000$0.030000$0.150000
Grok 3PremiumxAI$0.006000$0.030000$0.060000$0.300000
GPT-5.5 ProPremiumOpenAI$0.006750$0.033750$0.067500$0.337500

Key Takeaways

The 675x Gap

The cheapest model (Llama 3.1 8B at $0.000100/request) costs 675x less than the most expensive (GPT-5.5 Pro at $0.067500/request) for a 1K-token request. At 5K tokens, the gap holds at 675x.

Calculate your exact monthly costs across all 33 models

Open the Calculator — Free

How to Use This Table

These costs assume a 3:1 input-to-output token ratio. Your actual costs depend on your specific workload:

For exact calculations with your token ratios, use our interactive calculator or token estimator.