Live AI API Pricing

Every model. Every provider. Sort by cost and find the cheapest option for your workload.

Last verified: · 42 models · 10 providers

Cheapest input/1M
Cheapest output/1M
Price range (output)
Active models
Model Provider Tier Input/1M Output/1M Context

Find your cheapest alternative in 30 seconds

Enter your current model and usage. APIpulse calculates exactly how much you'd save by switching.

Calculate My Savings →

Free · No signup · Instant results

📊 Want these prices on your site? Embed a live widget →

Frequently Asked Questions

How often is this pricing data updated?

We verify pricing data regularly from official provider documentation. Prices shown reflect the latest publicly listed rates from each provider's API pricing page. Last verified date is shown at the bottom of the page.

Which AI model is cheapest right now?

The cheapest models change frequently. Use the table above — sort by "Output" or "Input" price to find the current cheapest option. Budget-tier models like Gemini Flash Lite, DeepSeek V4 Flash, and Llama 4 Scout typically offer the lowest per-token costs.

What's the difference between input and output pricing?

Input pricing is what you pay for tokens sent TO the model (your prompt + context). Output pricing is what you pay for tokens the model generates (its response). Output tokens are typically 2-10x more expensive than input tokens.

How much can I save by switching models?

Savings depend on your usage pattern. Our users save an average of 40% by switching to the right model for their workload. Use the Savings Calculator to see your exact potential savings based on your token usage.

Are there hidden costs beyond per-token pricing?

Some providers charge for image generation, fine-tuning, or dedicated capacity. Most standard text API calls only incur per-token costs. Always check the provider's pricing page for edge cases like batch processing discounts or rate limit tiers.