Claude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/MClaude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/M

Fireworks AI

Fireworks AI is an AI model provider.Tokenando tracks 9 Fireworks AI models, with input pricing from $0.100/M and an average blended cost of $1.244/M. Its flagship model is Qwen2.5-72B (fw).

Serverless open-model inference focused on low latency. Function calling, JSON mode and LoRA hosting.

Founded 2022HQ Redwood City, USAWebsite ↗Official pricing ↗API docs ↗

MODELS TRACKED

3 categories

FLAGSHIP

Qwen2.5-72B (fw)

Live API

MIN INPUT

$0.100/M

cheapest model in family

AVG BLENDED

$1.244/M

across 9 priced models

MAX CONTEXT

131K

largest window in family

Frontier

1 model

Llama 3.1 405B (fw)profile

Live API · 131K ctx

in $3.000/Mout $3.000/M

405B serverless · manual-seed

Reasoning

1 model

DeepSeek-R1 (fw)profile

Live API · 64K ctx

in $3.000/Mout $8.000/M

R1 serverless · manual-seed

Efficient

7 models

Mixtral 8x7B (fw)profile

Live API · 32K ctx

in $0.200/Mout $0.200/M

MoE serverless · manual-seed

Qwen2.5-72B (fw)profile

Live API · 32K ctx

in $0.900/Mout $0.900/M

Qwen serverless · manual-seed

Llama 3.1 8B (fw)profile

Live API · 131K ctx

in $0.100/Mout $0.100/M

Cheapest 8B · manual-seed

Yi-34B (fw)profile

Live API · 4K ctx

in $0.900/Mout $0.900/M

Yi on Fireworks · manual-seed

Phind-CodeLlama-34B (fw)profile

Live API · 16K ctx

in $0.800/Mout $0.800/M

Code specialist · manual-seed

Inception: Mercury 2

text->text · 128K ctx

in $0.250/Mout $0.750/M

tokenizer: Other · cron:openrouter

Mercury 2

text->text · 128K ctx

in $0.250/Mout $0.750/M

Frequently Asked Questions

How many models does Fireworks AI offer?

Tokenando tracks 9 Fireworks AI models.

How much do Fireworks AI models cost?

Fireworks AI model input pricing starts at $0.100 per million tokens, with an average blended cost of $1.244 per million across the 9 priced models we track.

What is Fireworks AI's flagship model?

Fireworks AI's flagship model is Qwen2.5-72B (fw). It's the highest-tier Fireworks AI model we track, with input pricing of $0.900 per million tokens.

What model categories does Fireworks AI cover?

Fireworks AI covers 3 categories: frontier, reasoning and efficient.