Claude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/MClaude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/M
BETA
Google
Google
EfficientAUTO PROFILE

Gemini 3 Flash

Efficient

Gemini 3 Flash is a compact variant — chosen for high throughput and low cost rather than absolute capability. Available via Google. 1M-token context. Mid-tier pricing.

Gemini 3 Flash is a efficient AI model from Google. It costs $0.500 per million input tokens and $3.000 per million output tokens (blended $1.250/M), with a 1M-token context window.

INPUT
$0.500/M
per million input tokens
OUTPUT
$3.000/M
per million output tokens
CONTEXT
1M
1,000,000 tokens
What it's good at
  • Sub-second latency
  • Affordable at scale
  • Good for routing & classification
Typical use cases
  • Routing & classification
  • Bulk summarisation
  • Realtime chat UX
Benchmarks
No published benchmark scores tracked for this model yet. Frontier and reasoning models from major providers have scores; smaller models and inference-host variants typically inherit the underlying open-weights score.
More from Google
See all 23
Frequently asked questions

How much does Gemini 3 Flash cost?

Gemini 3 Flash costs $0.500 per million input tokens and $3.000 per million output tokens, for a blended reference rate of $1.250 per million tokens.

What is Gemini 3 Flash's context window?

Gemini 3 Flash supports up to 1M tokens of context (1,000,000 tokens).

What is Gemini 3 Flash best for?

Gemini 3 Flash is well suited to Sub-second latency, Affordable at scale and Good for routing & classification.

Who makes Gemini 3 Flash?

Gemini 3 Flash is developed and served by Google.