Claude Opus 4.6
$11.000/M
Claude Opus 4.5
$33.000/M
Claude Sonnet 3.7
$6.600/M
Claude Opus 3
$33.000/M
Claude 2.1
$12.800/M
Claude 2
$12.800/M
GPT-5
$3.875/M
GPT-4.5
$97.500/M
GPT-4 Turbo Preview
$16.000/M
GPT-4
$39.000/M
GPT-4-32k
$78.000/M
o3
$19.000/M
o3-mini
$2.090/M
o4-mini
$2.090/M
o1
$28.500/M
o1-mini
$5.700/M
o1-preview
$28.500/M
Gemini 2.5 Pro
$3.875/M
Gemini 1.5 Pro
$2.375/M
Gemini 1.0 Ultra
$12.000/M
Gemini 1.0 Pro
$0.800/M
PaLM 2 Bison
$0.500/M
PaLM 2 Unicorn
$5.000/M
Gemma 3 27B
$0.270/M
Grok 3
$6.600/M
Grok 2
$4.400/M
Grok 1.5
$8.000/M
DeepSeek-V3
$0.519/M
DeepSeek-V3-0324
$0.519/M
DeepSeek-R1
$1.042/M
Claude Opus 4.6
$11.000/M
Claude Opus 4.5
$33.000/M
Claude Sonnet 3.7
$6.600/M
Claude Opus 3
$33.000/M
Claude 2.1
$12.800/M
Claude 2
$12.800/M
GPT-5
$3.875/M
GPT-4.5
$97.500/M
GPT-4 Turbo Preview
$16.000/M
GPT-4
$39.000/M
GPT-4-32k
$78.000/M
o3
$19.000/M
o3-mini
$2.090/M
o4-mini
$2.090/M
o1
$28.500/M
o1-mini
$5.700/M
o1-preview
$28.500/M
Gemini 2.5 Pro
$3.875/M
Gemini 1.5 Pro
$2.375/M
Gemini 1.0 Ultra
$12.000/M
Gemini 1.0 Pro
$0.800/M
PaLM 2 Bison
$0.500/M
PaLM 2 Unicorn
$5.000/M
Gemma 3 27B
$0.270/M
Grok 3
$6.600/M
Grok 2
$4.400/M
Grok 1.5
$8.000/M
DeepSeek-V3
$0.519/M
DeepSeek-V3-0324
$0.519/M
DeepSeek-R1
$1.042/M
BETA
Home
Feed
Insights
Index
Context
About
Subscribe
← All providers
·
AI model pricing index
Lepton AI
Cloud-native AI runtime with serverless model inference and dedicated deployments.
Founded
2023
HQ
Sunnyvale, USA
Website ↗
API docs ↗
MODELS TRACKED
3
1 category
FLAGSHIP
Llama 3.1 70B (Lepton)
efficient
MIN INPUT
$0.070/M
cheapest model in family
AVG BLENDED
$0.457/M
across 3 priced models
MAX CONTEXT
128K
largest window in family
Efficient
3 models
Llama 3.1 70B (Lepton)
profile
efficient · 128K ctx
in
$0.800/M
out
$0.800/M
Lepton inference
Mistral 7B (Lepton)
profile
efficient · 32K ctx
in
$0.070/M
out
$0.070/M
Cheapest at Lepton
Mixtral 8x7B (Lepton)
profile
efficient · 32K ctx
in
$0.500/M
out
$0.500/M
MoE on Lepton