Back to home
LLM Model Pricing Comparison 2026
Input and output token costs per 1M tokens across 290 models from OpenAI, Anthropic, Google AI, Mistral, DeepSeek, and OpenRouter. Prices sourced from provider billing APIs.
Input tokens
Cost for the text you send to the model (your prompt + context).
Output tokens
Cost for text the model generates. Typically 3–5× more expensive than input.
Why LLMeter?
LLMeter tracks your actual spend across all providers automatically — no manual spreadsheets.
Showing 290 of 290 models
Anthropic(13 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Claude 3 Haiku claude-3-haiku | Budget | $0.250 | $1.25 |
Claude 3.5 Haiku claude-3.5-haiku | Budget | $0.800 | $4.00 |
Claude Haiku 4.5 claude-haiku-4.5 | Standard | $1.00 | $5.00 |
Claude Sonnet 4 claude-sonnet-4 | Standard | $3.00 | $15.00 |
Claude Sonnet 4.5 claude-sonnet-4.5 | Standard | $3.00 | $15.00 |
Claude Sonnet 4.6 claude-sonnet-4.6 | Standard | $3.00 | $15.00 |
Claude Opus 4.5 claude-opus-4.5 | Standard | $5.00 | $25.00 |
Claude Opus 4.6 claude-opus-4.6 | Standard | $5.00 | $25.00 |
Claude Opus 4.7 claude-opus-4.7 | Standard | $5.00 | $25.00 |
Claude Opus 4 claude-opus-4 | Premium | $15.00 | $75.00 |
Claude Opus 4.1 claude-opus-4.1 | Premium | $15.00 | $75.00 |
Claude Opus 4.6 (Fast) claude-opus-4.6-fast | Premium | $30.00 | $150.00 |
Claude Opus 4.7 (Fast) claude-opus-4.7-fast | Premium | $30.00 | $150.00 |
OpenAI(65 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
gpt-oss-120b (free) gpt-oss-120b:free | Budget | $0 | $0 |
gpt-oss-20b (free) gpt-oss-20b:free | Budget | $0 | $0 |
gpt-oss-20b gpt-oss-20b | Budget | $0.030 | $0.140 |
gpt-oss-120b gpt-oss-120b | Budget | $0.039 | $0.180 |
GPT-5 Nano gpt-5-nano | Budget | $0.050 | $0.400 |
gpt-oss-safeguard-20b gpt-oss-safeguard-20b | Budget | $0.075 | $0.300 |
GPT-4.1 Nano gpt-4.1-nano | Budget | $0.100 | $0.400 |
GPT-4o-mini gpt-4o-mini | Budget | $0.150 | $0.600 |
GPT-4o-mini (2024-07-18) gpt-4o-mini-2024-07-18 | Budget | $0.150 | $0.600 |
GPT-4o-mini Search Preview gpt-4o-mini-search-preview | Budget | $0.150 | $0.600 |
GPT-5.4 Nano gpt-5.4-nano | Budget | $0.200 | $1.25 |
GPT-5 Mini gpt-5-mini | Budget | $0.250 | $2.00 |
GPT-5.1-Codex-Mini gpt-5.1-codex-mini | Budget | $0.250 | $2.00 |
GPT-4.1 Mini gpt-4.1-mini | Budget | $0.400 | $1.60 |
GPT-3.5 Turbo gpt-3.5-turbo | Budget | $0.500 | $1.50 |
GPT Audio Mini gpt-audio-mini | Budget | $0.600 | $2.40 |
GPT-5.4 Mini gpt-5.4-mini | Budget | $0.750 | $4.50 |
GPT-3.5 Turbo (older v0613) gpt-3.5-turbo-0613 | Standard | $1.00 | $2.00 |
o3 Mini o3-mini | Standard | $1.10 | $4.40 |
o3 Mini High o3-mini-high | Standard | $1.10 | $4.40 |
o4 Mini o4-mini | Standard | $1.10 | $4.40 |
o4 Mini High o4-mini-high | Standard | $1.10 | $4.40 |
GPT-5 gpt-5 | Standard | $1.25 | $10.00 |
GPT-5 Chat gpt-5-chat | Standard | $1.25 | $10.00 |
GPT-5 Codex gpt-5-codex | Standard | $1.25 | $10.00 |
GPT-5.1 gpt-5.1 | Standard | $1.25 | $10.00 |
GPT-5.1 Chat gpt-5.1-chat | Standard | $1.25 | $10.00 |
GPT-5.1-Codex gpt-5.1-codex | Standard | $1.25 | $10.00 |
GPT-5.1-Codex-Max gpt-5.1-codex-max | Standard | $1.25 | $10.00 |
GPT-3.5 Turbo Instruct gpt-3.5-turbo-instruct | Standard | $1.50 | $2.00 |
GPT-5.2 gpt-5.2 | Standard | $1.75 | $14.00 |
GPT-5.2 Chat gpt-5.2-chat | Standard | $1.75 | $14.00 |
GPT-5.2-Codex gpt-5.2-codex | Standard | $1.75 | $14.00 |
GPT-5.3 Chat gpt-5.3-chat | Standard | $1.75 | $14.00 |
GPT-5.3-Codex gpt-5.3-codex | Standard | $1.75 | $14.00 |
GPT-4.1 gpt-4.1 | Standard | $2.00 | $8.00 |
o3 o3 | Standard | $2.00 | $8.00 |
o4 Mini Deep Research o4-mini-deep-research | Standard | $2.00 | $8.00 |
GPT-4o gpt-4o | Standard | $2.50 | $10.00 |
GPT-4o (2024-08-06) gpt-4o-2024-08-06 | Standard | $2.50 | $10.00 |
GPT-4o (2024-11-20) gpt-4o-2024-11-20 | Standard | $2.50 | $10.00 |
GPT-4o Audio gpt-4o-audio-preview | Standard | $2.50 | $10.00 |
GPT-4o Search Preview gpt-4o-search-preview | Standard | $2.50 | $10.00 |
GPT-5 Image Mini gpt-5-image-mini | Standard | $2.50 | $2.00 |
GPT-5.4 gpt-5.4 | Standard | $2.50 | $15.00 |
GPT Audio gpt-audio | Standard | $2.50 | $10.00 |
GPT-3.5 Turbo 16k gpt-3.5-turbo-16k | Standard | $3.00 | $4.00 |
GPT-4o (2024-05-13) gpt-4o-2024-05-13 | Standard | $5.00 | $15.00 |
GPT-5.5 gpt-5.5 | Standard | $5.00 | $30.00 |
GPT Chat Latest gpt-chat-latest | Standard | $5.00 | $30.00 |
GPT-5.4 Image 2 gpt-5.4-image-2 | Standard | $8.00 | $15.00 |
GPT-4 Turbo (older v1106) gpt-4-1106-preview | Premium | $10.00 | $30.00 |
GPT-4 Turbo gpt-4-turbo | Premium | $10.00 | $30.00 |
GPT-4 Turbo Preview gpt-4-turbo-preview | Premium | $10.00 | $30.00 |
GPT-5 Image gpt-5-image | Premium | $10.00 | $10.00 |
o3 Deep Research o3-deep-research | Premium | $10.00 | $40.00 |
GPT-5 Pro gpt-5-pro | Premium | $15.00 | $120.00 |
o1 o1 | Premium | $15.00 | $60.00 |
o3 Pro o3-pro | Premium | $20.00 | $80.00 |
GPT-5.2 Pro gpt-5.2-pro | Premium | $21.00 | $168.00 |
GPT-4 gpt-4 | Premium | $30.00 | $60.00 |
GPT-4 (older v0314) gpt-4-0314 | Premium | $30.00 | $60.00 |
GPT-5.4 Pro gpt-5.4-pro | Premium | $30.00 | $180.00 |
GPT-5.5 Pro gpt-5.5-pro | Premium | $30.00 | $180.00 |
o1-pro o1-pro | Premium | $150.00 | $600.00 |
DeepSeek(13 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
DeepSeek V4 Flash deepseek-v4-flash | Budget | $0.112 | $0.224 |
DeepSeek V3 0324 deepseek-chat-v3-0324 | Budget | $0.200 | $0.770 |
DeepSeek V3.1 deepseek-chat-v3.1 | Budget | $0.210 | $0.790 |
DeepSeek V3.2 deepseek-v3.2 | Budget | $0.252 | $0.378 |
DeepSeek V3.1 Terminus deepseek-v3.1-terminus | Budget | $0.270 | $0.950 |
DeepSeek V3.2 Exp deepseek-v3.2-exp | Budget | $0.270 | $0.410 |
DeepSeek V3.2 Speciale deepseek-v3.2-speciale | Budget | $0.287 | $0.431 |
R1 Distill Qwen 32B deepseek-r1-distill-qwen-32b | Budget | $0.290 | $0.290 |
DeepSeek V3 deepseek-chat | Budget | $0.320 | $0.890 |
DeepSeek V4 Pro deepseek-v4-pro | Budget | $0.435 | $0.870 |
R1 0528 deepseek-r1-0528 | Budget | $0.500 | $2.15 |
R1 deepseek-r1 | Budget | $0.700 | $2.50 |
R1 Distill Llama 70B deepseek-r1-distill-llama-70b | Budget | $0.700 | $0.800 |
Google AI(27 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Gemma 4 26B A4B (free) gemma-4-26b-a4b-it:free | Budget | $0 | $0 |
Gemma 4 31B (free) gemma-4-31b-it:free | Budget | $0 | $0 |
Lyria 3 Clip Preview lyria-3-clip-preview | Budget | $0 | $0 |
Lyria 3 Pro Preview lyria-3-pro-preview | Budget | $0 | $0 |
Gemma 3 12B gemma-3-12b-it | Budget | $0.040 | $0.130 |
Gemma 3 4B gemma-3-4b-it | Budget | $0.040 | $0.080 |
Gemma 3n 4B gemma-3n-e4b-it | Budget | $0.060 | $0.120 |
Gemma 4 26B A4B gemma-4-26b-a4b-it | Budget | $0.060 | $0.330 |
Gemini 2.0 Flash Lite gemini-2.0-flash-lite-001 | Budget | $0.075 | $0.300 |
Gemma 3 27B gemma-3-27b-it | Budget | $0.080 | $0.160 |
Gemini 2.0 Flash gemini-2.0-flash-001 | Budget | $0.100 | $0.400 |
Gemini 2.5 Flash Lite gemini-2.5-flash-lite | Budget | $0.100 | $0.400 |
Gemini 2.5 Flash Lite Preview 09-2025 gemini-2.5-flash-lite-preview-09-2025 | Budget | $0.100 | $0.400 |
Gemma 4 31B gemma-4-31b-it | Budget | $0.120 | $0.370 |
Gemini 3.1 Flash Lite gemini-3.1-flash-lite | Budget | $0.250 | $1.50 |
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview | Budget | $0.250 | $1.50 |
Gemini 2.5 Flash gemini-2.5-flash | Budget | $0.300 | $2.50 |
Nano Banana (Gemini 2.5 Flash Image) gemini-2.5-flash-image | Budget | $0.300 | $2.50 |
Gemini 3 Flash Preview gemini-3-flash-preview | Budget | $0.500 | $3.00 |
Nano Banana 2 (Gemini 3.1 Flash Image Preview) gemini-3.1-flash-image-preview | Budget | $0.500 | $3.00 |
Gemma 2 27B gemma-2-27b-it | Budget | $0.650 | $0.650 |
Gemini 2.5 Pro gemini-2.5-pro | Standard | $1.25 | $10.00 |
Gemini 2.5 Pro Preview 06-05 gemini-2.5-pro-preview | Standard | $1.25 | $10.00 |
Gemini 2.5 Pro Preview 05-06 gemini-2.5-pro-preview-05-06 | Standard | $1.25 | $10.00 |
Nano Banana Pro (Gemini 3 Pro Image Preview) gemini-3-pro-image-preview | Standard | $2.00 | $12.00 |
Gemini 3.1 Pro Preview gemini-3.1-pro-preview | Standard | $2.00 | $12.00 |
Gemini 3.1 Pro Preview Custom Tools gemini-3.1-pro-preview-customtools | Standard | $2.00 | $12.00 |
xai(6 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Grok 3 Mini grok-3-mini | Budget | $0.300 | $0.500 |
Grok 3 Mini Fast grok-3-mini-fast | Budget | $0.600 | $4.00 |
Grok 2 grok-2-1212 | Standard | $2.00 | $10.00 |
Grok 2 Vision grok-2-vision-1212 | Standard | $2.00 | $10.00 |
Grok 3 grok-3 | Premium | $3.00 | $15.00 |
Grok 3 Fast grok-3-fast | Standard | $5.00 | $25.00 |
groq(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B Instant llama-3.1-8b-instant | Budget | $0.050 | $0.080 |
Llama 3 8B llama3-8b-8192 | Budget | $0.050 | $0.080 |
Llama 4 Scout meta-llama/llama-4-scout-17b-16e-instruct | Standard | $0.110 | $0.340 |
Llama 3.2 11B Vision llama-3.2-11b-vision-preview | Budget | $0.180 | $0.180 |
Llama 4 Maverick meta-llama/llama-4-maverick-17b-128e-instruct | Premium | $0.200 | $0.600 |
Gemma 2 9B gemma2-9b-it | Budget | $0.200 | $0.200 |
Mixtral 8x7B mixtral-8x7b-32768 | Standard | $0.240 | $0.240 |
Llama 3.3 70B Versatile llama-3.3-70b-versatile | Standard | $0.590 | $0.790 |
Llama 3 70B llama3-70b-8192 | Standard | $0.590 | $0.790 |
Llama 3.2 90B Vision llama-3.2-90b-vision-preview | Premium | $0.900 | $0.900 |
cohere(8 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Embed English v3 embed-english-v3.0 | Budget | $0.100 | $0 |
Embed Multilingual v3 embed-multilingual-v3.0 | Budget | $0.100 | $0 |
Command R command-r | Standard | $0.150 | $0.600 |
Command R (Aug 2024) command-r-08-2024 | Standard | $0.150 | $0.600 |
Command Light command-light | Budget | $0.300 | $0.600 |
Command command | Standard | $1.00 | $2.00 |
Command R+ command-r-plus | Premium | $2.50 | $10.00 |
Command R+ (Aug 2024) command-r-plus-08-2024 | Premium | $2.50 | $10.00 |
together(12 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 4 Scout 17B meta-llama/llama-4-scout-17b-16e-instruct | Standard | $0.180 | $0.590 |
Llama 3.1 8B Turbo meta-llama/meta-llama-3.1-8b-instruct-turbo | Budget | $0.180 | $0.180 |
Mistral 7B Instruct mistralai/mistral-7b-instruct-v0.3 | Budget | $0.200 | $0.200 |
Llama 4 Maverick 17B meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | Standard | $0.270 | $0.850 |
Qwen 2.5 7B Turbo qwen/qwen2.5-7b-instruct-turbo | Budget | $0.300 | $0.300 |
Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct-v0.1 | Standard | $0.540 | $0.540 |
Llama 3.3 70B Turbo meta-llama/llama-3.3-70b-instruct-turbo | Standard | $0.880 | $0.880 |
Llama 3.1 70B Turbo meta-llama/meta-llama-3.1-70b-instruct-turbo | Standard | $0.880 | $0.880 |
Qwen 2.5 72B Turbo qwen/qwen2.5-72b-instruct-turbo | Standard | $1.20 | $1.20 |
DeepSeek V3 deepseek-ai/deepseek-v3 | Standard | $1.25 | $1.25 |
Llama 3.1 405B Turbo meta-llama/meta-llama-3.1-405b-instruct-turbo | Premium | $3.50 | $3.50 |
DeepSeek R1 deepseek-ai/deepseek-r1 | Premium | $7.00 | $7.00 |
fireworks(13 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 4 Scout accounts/fireworks/models/llama4-scout-instruct-basic | Budget | $0.150 | $0.600 |
Llama 3.1 8B Instruct accounts/fireworks/models/llama-v3p1-8b-instruct | Budget | $0.200 | $0.200 |
Gemma 2 9B IT accounts/fireworks/models/gemma2-9b-it | Budget | $0.200 | $0.200 |
Llama 3 8B Instruct accounts/fireworks/models/llama-v3-8b-instruct | Budget | $0.200 | $0.200 |
Llama 4 Maverick accounts/fireworks/models/llama4-maverick-instruct-basic | Standard | $0.220 | $0.880 |
Mixtral 8x7B Instruct accounts/fireworks/models/mixtral-8x7b-instruct | Standard | $0.500 | $0.500 |
Llama 3.3 70B Instruct accounts/fireworks/models/llama-v3p3-70b-instruct | Standard | $0.900 | $0.900 |
Llama 3.1 70B Instruct accounts/fireworks/models/llama-v3p1-70b-instruct | Standard | $0.900 | $0.900 |
DeepSeek V3 accounts/fireworks/models/deepseek-v3 | Standard | $0.900 | $0.900 |
Qwen 2.5 72B Instruct accounts/fireworks/models/qwen2p5-72b-instruct | Standard | $0.900 | $0.900 |
Llama 3 70B Instruct accounts/fireworks/models/llama-v3-70b-instruct | Standard | $0.900 | $0.900 |
Llama 3.1 405B Instruct accounts/fireworks/models/llama-v3p1-405b-instruct | Premium | $3.00 | $3.00 |
DeepSeek R1 accounts/fireworks/models/deepseek-r1 | Premium | $8.00 | $8.00 |
perplexity(6 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Sonar sonar | Budget | $1.00 | $1.00 |
Sonar Reasoning sonar-reasoning | Standard | $1.00 | $5.00 |
Sonar Reasoning Pro sonar-reasoning-pro | Premium | $2.00 | $8.00 |
Sonar Deep Research sonar-deep-research | Premium | $2.00 | $8.00 |
R1 1776 r1-1776 | Standard | $2.00 | $8.00 |
Sonar Pro sonar-pro | Premium | $3.00 | $15.00 |
cerebras(5 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B llama3.1-8b | Budget | $0.100 | $0.100 |
Qwen 3 32B qwen-3-32b | Standard | $0.400 | $0.400 |
Llama 3.1 70B llama3.1-70b | Standard | $0.600 | $0.600 |
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | Standard | $0.600 | $0.600 |
Llama 3.3 70B llama-3.3-70b | Standard | $0.850 | $0.850 |
ai21(4 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Jamba 1.5 Mini jamba-1.5-mini | Budget | $0.200 | $0.400 |
Jamba 1.6 Mini jamba-1.6-mini | Budget | $0.200 | $0.400 |
Jamba 1.5 Large jamba-1.5-large | Premium | $2.00 | $8.00 |
Jamba 1.6 Large jamba-1.6-large | Premium | $2.00 | $8.00 |
deepinfra(12 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B meta-llama/Meta-Llama-3.1-8B-Instruct | Budget | $0.030 | $0.050 |
Llama 4 Scout 17B meta-llama/Llama-4-Scout-17B-16E-Instruct | Budget | $0.070 | $0.110 |
Phi-4 microsoft/Phi-4 | Budget | $0.070 | $0.140 |
QwQ 32B Qwen/QwQ-32B | Standard | $0.120 | $0.180 |
Llama 3.1 Nemotron 70B nvidia/Llama-3.1-Nemotron-70B-Instruct | Standard | $0.120 | $0.300 |
Llama 4 Maverick 17B meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | Standard | $0.180 | $0.540 |
Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct | Standard | $0.220 | $0.590 |
Mixtral 8x7B mistralai/Mixtral-8x7B-Instruct-v0.1 | Standard | $0.240 | $0.240 |
Llama 3.1 70B meta-llama/Meta-Llama-3.1-70B-Instruct | Standard | $0.350 | $0.390 |
Qwen 2.5 72B Qwen/Qwen2.5-72B-Instruct | Standard | $0.350 | $0.390 |
DeepSeek V3 deepseek-ai/DeepSeek-V3 | Standard | $0.420 | $0.850 |
DeepSeek R1 deepseek-ai/DeepSeek-R1 | Premium | $0.550 | $2.19 |
novita(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Mistral 7B mistralai/mistral-7b-instruct-v0.3 | Budget | $0.030 | $0.030 |
Llama 3.1 8B meta-llama/llama-3.1-8b-instruct | Budget | $0.060 | $0.060 |
Qwen 2.5 7B Qwen/Qwen2.5-7B-Instruct | Budget | $0.070 | $0.070 |
QwQ 32B Qwen/QwQ-32B | Standard | $0.120 | $0.180 |
Gemma 2 27B google/gemma-2-27b-it | Standard | $0.200 | $0.200 |
Qwen 2.5 72B Qwen/Qwen2.5-72B-Instruct | Standard | $0.350 | $0.350 |
DeepSeek V3 deepseek/deepseek_v3_0324 | Standard | $0.380 | $0.380 |
Llama 3.1 70B meta-llama/llama-3.1-70b-instruct | Standard | $0.400 | $0.400 |
Llama 3.3 70B meta-llama/llama-3.3-70b-instruct | Standard | $0.400 | $0.400 |
DeepSeek R1 deepseek/deepseek_r1 | Premium | $0.550 | $2.19 |
hyperbolic(9 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B meta-llama/Meta-Llama-3.1-8B-Instruct | Budget | $0.080 | $0.080 |
Llama 4 Scout meta-llama/Llama-4-Scout-17B-16E-Instruct | Budget | $0.100 | $0.300 |
Mistral 7B mistralai/Mistral-7B-Instruct-v0.3 | Budget | $0.110 | $0.110 |
Llama 3.3 70B meta-llama/Meta-Llama-3.3-70B-Instruct | Standard | $0.400 | $0.400 |
Llama 3.1 70B meta-llama/Meta-Llama-3.1-70B-Instruct | Standard | $0.400 | $0.400 |
DeepSeek V3 deepseek-ai/DeepSeek-V3 | Standard | $0.400 | $0.400 |
Qwen 2.5 72B Qwen/Qwen2.5-72B-Instruct | Standard | $0.400 | $0.400 |
Llama 4 Maverick meta-llama/Llama-4-Maverick-17B-128E-Instruct | Standard | $0.500 | $1.50 |
DeepSeek R1 deepseek-ai/DeepSeek-R1 | Premium | $0.500 | $2.18 |
sambanova(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.2 1B Meta-Llama-3.2-1B-Instruct | Budget | $0.040 | $0.040 |
Llama 3.2 3B Meta-Llama-3.2-3B-Instruct | Budget | $0.080 | $0.080 |
Llama 3.1 8B Meta-Llama-3.1-8B-Instruct | Budget | $0.100 | $0.100 |
Qwen 2.5 Coder 32B Qwen2.5-Coder-32B-Instruct | Standard | $0.400 | $0.800 |
Llama 3.1 70B Meta-Llama-3.1-70B-Instruct | Standard | $0.600 | $1.20 |
Llama 3.3 70B Meta-Llama-3.3-70B-Instruct | Standard | $0.600 | $1.20 |
Qwen 2.5 72B Qwen2.5-72B-Instruct | Standard | $0.600 | $1.20 |
DeepSeek V3 DeepSeek-V3-0324 | Standard | $0.700 | $1.40 |
Llama 3.1 405B Meta-Llama-3.1-405B-Instruct | Premium | $2.00 | $2.00 |
DeepSeek R1 DeepSeek-R1 | Premium | $3.00 | $10.00 |
lambdalabs(9 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B meta-llama/Llama-3.1-8B-Instruct | Budget | $0.018 | $0.018 |
Hermes 3 8B hermes3-8b | Budget | $0.018 | $0.018 |
Qwen 2.5 Coder 32B Qwen/Qwen2.5-Coder-32B-Instruct | Standard | $0.040 | $0.040 |
Liquid LFM 40B MoE lfm-40b | Standard | $0.040 | $0.040 |
Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct-FP8 | Standard | $0.060 | $0.090 |
Llama 3.1 70B meta-llama/Llama-3.1-70B-Instruct-FP8 | Standard | $0.060 | $0.090 |
Hermes 3 70B hermes3-70b | Standard | $0.060 | $0.090 |
Llama 3.1 405B meta-llama/Llama-3.1-405B-Instruct-FP8 | Premium | $0.530 | $0.530 |
Hermes 3 405B hermes3-405b | Premium | $0.530 | $0.530 |
inferencenet(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B meta-llama/llama-3.1-8b-instruct/fp-8 | Budget | $0.040 | $0.040 |
Mistral 7B mistralai/mistral-7b-instruct/fp-8 | Budget | $0.040 | $0.040 |
Gemma 2 9B google/gemma-2-9b-it/fp-8 | Budget | $0.050 | $0.050 |
Phi 4 microsoft/phi-4/fp-8 | Budget | $0.080 | $0.080 |
Mixtral 8x7B mistralai/mixtral-8x7b-instruct/fp-8 | Standard | $0.120 | $0.120 |
Llama 3.3 70B meta-llama/llama-3.3-70b-instruct/fp-8 | Standard | $0.200 | $0.200 |
Llama 3.1 70B meta-llama/llama-3.1-70b-instruct/fp-8 | Standard | $0.200 | $0.200 |
Qwen 2.5 72B qwen/qwen2.5-72b-instruct/fp-8 | Standard | $0.200 | $0.200 |
DeepSeek V3 deepseek/deepseek-v3/fp-8 | Standard | $0.250 | $0.250 |
DeepSeek R1 deepseek/deepseek-r1/fp-8 | Premium | $0.800 | $0.800 |
lepton(8 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3 8B llama3-8b | Budget | $0.060 | $0.060 |
Mistral 7B mistral-7b | Budget | $0.060 | $0.060 |
Llama 3.1 8B llama3-1-8b | Budget | $0.070 | $0.070 |
Mixtral 8x7B mixtral-8x7b | Standard | $0.300 | $0.300 |
Qwen 2.5 72B qwen2-5-72b | Standard | $0.600 | $0.600 |
Llama 3 70B llama3-70b | Standard | $0.700 | $0.700 |
Llama 3.1 70B llama3-1-70b | Standard | $0.800 | $0.800 |
Llama 3.1 405B llama3-1-405b | Premium | $2.80 | $2.80 |
nvidia(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B Instruct meta/llama-3.1-8b-instruct | Budget | $0.050 | $0.050 |
Mistral 7B Instruct mistralai/mistral-7b-instruct-v0.3 | Budget | $0.080 | $0.080 |
Gemma 2 9B IT google/gemma-2-9b-it | Budget | $0.090 | $0.090 |
Llama 3.3 70B Instruct meta/llama-3.3-70b-instruct | Standard | $0.230 | $0.230 |
Phi 3 Medium 128K microsoft/phi-3-medium-128k-instruct | Budget | $0.250 | $0.250 |
Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct-v0.1 | Standard | $0.300 | $0.300 |
Llama 3.1 70B Instruct meta/llama-3.1-70b-instruct | Standard | $0.350 | $0.400 |
DeepSeek R1 deepseek-ai/deepseek-r1 | Standard | $0.800 | $2.40 |
Llama 3.1 405B Instruct meta/llama-3.1-405b-instruct | Premium | $2.99 | $2.99 |
Nemotron 4 340B Instruct nvidia/nemotron-4-340b-instruct | Premium | $4.20 | $4.20 |
cloudflare(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.2 1B Instruct @cf/meta/llama-3.2-1b-instruct | Budget | $0.060 | $0.060 |
Llama 3.2 3B Instruct @cf/meta/llama-3.2-3b-instruct | Budget | $0.080 | $0.080 |
Gemma 2B IT @cf/google/gemma-2b-it | Budget | $0.080 | $0.080 |
Phi-2 @cf/microsoft/phi-2 | Budget | $0.080 | $0.080 |
Llama 3.1 8B Instruct (Fast) @cf/meta/llama-3.1-8b-instruct-fast | Budget | $0.100 | $0.100 |
Mistral 7B Instruct v0.1 @cf/mistral/mistral-7b-instruct-v0.1 | Budget | $0.110 | $0.110 |
Gemma 7B IT @cf/google/gemma-7b-it | Budget | $0.110 | $0.110 |
Llama 3.2 11B Vision Instruct @cf/meta/llama-3.2-11b-vision-instruct | Standard | $0.140 | $0.140 |
Qwen 1.5 14B Chat (AWQ) @cf/qwen/qwen1.5-14b-chat-awq | Standard | $0.180 | $0.180 |
Llama 3.3 70B Instruct (Fast) @cf/meta/llama-3.3-70b-instruct-fp8-fast | Standard | $0.560 | $0.560 |
nebius(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Mistral Nemo mistralai/Mistral-Nemo-Instruct-2407 | Budget | $0.040 | $0.040 |
Phi-3 Mini (128k) microsoft/Phi-3-mini-128k-instruct | Budget | $0.040 | $0.040 |
Gemma 2 9B google/gemma-2-9b-it | Budget | $0.040 | $0.040 |
Llama 3.1 8B Instruct meta-llama/Llama-3.1-8B-Instruct | Budget | $0.060 | $0.060 |
Qwen 2.5 7B Instruct Qwen/Qwen2.5-7B-Instruct | Budget | $0.060 | $0.060 |
Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct | Standard | $0.130 | $0.400 |
Llama 3.1 70B Instruct meta-llama/Llama-3.1-70B-Instruct | Standard | $0.130 | $0.400 |
Qwen 2.5 72B Instruct Qwen/Qwen2.5-72B-Instruct | Standard | $0.130 | $0.400 |
DeepSeek V3 deepseek-ai/DeepSeek-V3 | Standard | $0.280 | $1.10 |
DeepSeek R1 deepseek-ai/DeepSeek-R1 | Premium | $0.550 | $2.19 |
replicate(10 models)
| Model | Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|---|
Llama 3.1 8B meta/llama-3.1-8b-instruct | Budget | $0.050 | $0.050 |
Llama 3.2 11B Vision meta/llama-3.2-11b-vision-instruct | Budget | $0.055 | $0.055 |
Gemma 2 9B google-deepmind/gemma-2-9b-it | Budget | $0.060 | $0.060 |
DeepSeek V3 deepseek-ai/deepseek-v3 | Standard | $0.270 | $1.10 |
Mixtral 8x7B mistralai/mixtral-8x7b-instruct-v0.1 | Standard | $0.300 | $0.300 |
Qwen 2.5 72B qwen/qwen2.5-72b-instruct | Standard | $0.350 | $0.400 |
Llama 3.1 70B meta/llama-3.1-70b-instruct | Standard | $0.650 | $0.650 |
Llama 3.3 70B meta/llama-3.3-70b-instruct | Standard | $0.900 | $0.900 |
DeepSeek R1 deepseek-ai/deepseek-r1 | Premium | $3.00 | $8.00 |
Llama 3.1 405B meta/llama-3.1-405b-instruct | Premium | $9.50 | $9.50 |