Updated 2026-07-04 · 13 models · first-party API prices
Every LLM API price.
One table, tracked daily.
Compare what Claude, GPT, Gemini, Grok, DeepSeek and Mistral actually cost per million tokens — with a change log, so you know when prices moved and by how much. Right now the cheapest blended price is GPT-5 nano.
Current API pricing
| Model | Provider | Input / MTok | Output / MTok | Blended* | Context | Class |
|---|---|---|---|---|---|---|
| GPT-5 nano | OpenAI | $0.05 | $0.40 | $0.14 | 400K | fast |
| Gemini 2.5 Flash-Lite | $0.10 | $0.40 | $0.18 | 1M | fast | |
| Grok 4 Fast | xAI | $0.20 | $0.50 | $0.28 | 2M | fast |
| DeepSeek V3.2 | DeepSeek | $0.28 | $0.42 | $0.32 | 128K | workhorse |
| GPT-5 mini | OpenAI | $0.25 | $2 | $0.69 | 400K | workhorse |
| Gemini 2.5 Flash | $0.30 | $2.50 | $0.85 | 1M | workhorse | |
| Claude Haiku 4.5 | Anthropic | $1 | $5 | $2 | 200K | fast |
| Mistral Medium 3.5 | Mistral | $1.50 | $7.50 | $3 | 128K | workhorse |
| GPT-5.1 | OpenAI | $1.25 | $10 | $3.44 | 400K | frontier |
| Gemini 3 Pro | $2 | $12 | $4.50 | 1M | frontier | |
| Claude Sonnet 4.5 | Anthropic | $3 | $15 | $6 | 200K | workhorse |
| Grok 4 | xAI | $3 | $15 | $6 | 256K | frontier |
| Claude Opus 4.5 | Anthropic | $5 | $25 | $10 | 200K | frontier |
*Blended assumes a 3:1 input-to-output token ratio, typical for chat and RAG workloads. USD per 1M tokens, first-party API, standard tier — no caching or batch discounts.
Latest price movements
-
Mistral replaced Medium 3 with Mistral Medium 3.5 at $1.50/$7.50 per MTok — a 3.75× price increase on the Medium line (Medium 3 was $0.40/$2.00). Verified against mistral.ai/pricing/api.
-
Claude Opus 4.5 launched at $5/$25 per MTok — a 67% cut versus Opus 4.1's $15/$75.
-
DeepSeek V3.2 cut API prices by 50%+ to $0.28/$0.42 per MTok (cache miss).