Side-by-side API pricing comparison: which model gives you more for less?
Last verified May 2026 · Prices per 1M tokens
| Feature | GPT-oss 120B | Gemini 2.5 Flash-Lite |
|---|---|---|
| Provider | OpenAI | |
| Tier | Budget | Budget |
| Input Price | $0.15 | $0.1 |
| Output Price | $0.6 | $0.4 |
| Context Window | 128K | 1M |
| Verified | May 2026 | Jun 2026 |
High-volume APIs, batch processing, and startups watching runway.
Tasks requiring advanced reasoning, code generation, or nuanced analysis.
Real-time chatbots, streaming responses, and latency-sensitive apps.
Development, experimentation, and non-critical workloads.
APIpulse Pro monitors 49 models across 10 providers. Get alerts when GPT-oss 120B or Gemini 2.5 Flash-Lite prices change.
Get Pro for $19 →Yes. Gemini 2.5 Flash-Lite costs $0.1 input / $0.4 output per 1M tokens, while GPT-oss 120B costs $0.15 input / $0.6 output. That's 33% cheaper on input and 33% cheaper on output.
For a typical workload (1M input + 500K output tokens/month), Gemini 2.5 Flash-Lite costs $0.30/month vs $0.45/month for GPT-oss 120B. That's a savings of $0.15/month (33%).
Choose Gemini 2.5 Flash-Lite for cost efficiency. Choose GPT-oss 120B for OpenAI ecosystem benefits. GPT-oss 120B has 128K context vs Gemini 2.5 Flash-Lite's 1M.