Claude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/MClaude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/M

GroqEfficient

Llama 3.2 3B (Groq)

Name: Llama 3.2 3B (Groq)
Brand: Groq
Price: 0.060000 USD

Fast Micro

Edge-targeted Llama 3.2 — designed for on-device assistants on phones and laptops.

Llama 3.2 3B (Groq) is a efficient AI model from Groq. It costs $0.060 per million input tokens and $0.060 per million output tokens (blended $0.060/M), with a 128K-token context window.

Profile inherited from upstream Llama 3.2 3B ↗ — this is a hosted variant of the same open-weights model.

Released Sep 2024Modalities textOfficial model page ↗Provider pricing ↗API docs ↗Compare with another model →Estimate monthly cost →

INPUT

$0.060/M

per million input tokens

OUTPUT

$0.060/M

per million output tokens

BLENDED 70/30

$0.060/M

default reference rate · how it's calculated →

CONTEXT

128K

128,000 tokens

What it's good at

Runs on-device
Open weights
128K context

Typical use cases

Mobile/edge assistants
On-device summarisation

Benchmarks

vs. best public score

Scores inherited from Llama 3.2 3B — this is a hosted variant of the same open-weights model, so the underlying benchmark scores are identical.

MMLU63%

Multitask academic knowledge across 57 subjects.

GPQA Diamond27%

Graduate-level science questions, "Google-proof".

MATH48%

High-school competition math problems.

HumanEval50%

Python function synthesis from docstrings.

Hand-curated from each provider's published reports and public leaderboards. Methodology varies across sources — treat as directional rather than authoritative.

How much does Llama 3.2 3B (Groq) cost?

Llama 3.2 3B (Groq) costs $0.060 per million input tokens and $0.060 per million output tokens, for a blended reference rate of $0.060 per million tokens.

What is Llama 3.2 3B (Groq)'s context window?

Llama 3.2 3B (Groq) supports up to 128K tokens of context (128,000 tokens).

What is Llama 3.2 3B (Groq) best for?

Llama 3.2 3B (Groq) is well suited to Runs on-device, Open weights and 128K context.

Who makes Llama 3.2 3B (Groq)?

Llama 3.2 3B (Groq) is developed and served by Groq. It was released in Sep 2024.