Claude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/MClaude Fable 5$22.000/MClaude Opus 4.8$11.000/MClaude Opus 4.7$11.000/MClaude Opus 4.6$11.000/MClaude Opus 4.5$33.000/MClaude Sonnet 3.7$6.600/MClaude Opus 3$33.000/MClaude 2.1$12.800/MClaude 2$12.800/MGPT-5.5$12.500/MGPT-5.2$5.425/MGPT-5.2-Codex$5.425/MGPT-5$3.875/MGPT-4.5$97.500/MGPT-4 Turbo Preview$16.000/MGPT-4$39.000/MGPT-4-32k$78.000/Mo3$19.000/Mo3-mini$2.090/Mo4-mini$2.090/Mo1$28.500/Mo1-mini$5.700/Mo1-preview$28.500/MGemini 3.5 Pro$5.000/MGemini 3.1 Pro$5.000/MGemini 3 Pro$5.000/MGemini 2.5 Pro$3.875/MGemini 1.5 Pro$2.375/MGemini 1.0 Ultra$12.000/MGemini 1.0 Pro$0.800/M

MetaEfficientLIVE INDEX

Llama 3.2 1B Instruct

Name: Llama 3.2 1B Instruct
Brand: Meta
Price: 0.027000 USD

text->text

Smallest Llama — 1B parameters for ultra-edge inference. Tradeoff: very limited reasoning.

Llama 3.2 1B Instruct is a efficient AI model from Meta. It costs $0.027 per million input tokens and $0.201 per million output tokens (blended $0.079/M), with a 131K-token context window.

Profile inherited from upstream Llama 3.2 1B ↗ — this is a hosted variant of the same open-weights model.

Released Sep 2024Modalities textOfficial model page ↗API docs ↗Pricing source ↗Compare with another model →Estimate monthly cost →

INPUT

$0.027/M

per million input tokens

OUTPUT

$0.201/M

per million output tokens

BLENDED 70/30

$0.079/M

default reference rate · how it's calculated →

CONTEXT

131K

131,072 tokens

What it's good at

Smallest Llama
Edge / mobile
Open weights

Typical use cases

On-device routing
Tiny-footprint chat

Benchmarks

vs. best public score

Scores inherited from Llama 3.2 1B — this is a hosted variant of the same open-weights model, so the underlying benchmark scores are identical.

MMLU49%

Multitask academic knowledge across 57 subjects.

GPQA Diamond25%

Graduate-level science questions, "Google-proof".

MATH30%

High-school competition math problems.

HumanEval38%

Python function synthesis from docstrings.

Hand-curated from each provider's published reports and public leaderboards. Methodology varies across sources — treat as directional rather than authoritative.

How much does Llama 3.2 1B Instruct cost?

Llama 3.2 1B Instruct costs $0.027 per million input tokens and $0.201 per million output tokens, for a blended reference rate of $0.079 per million tokens.

What is Llama 3.2 1B Instruct's context window?

Llama 3.2 1B Instruct supports up to 131K tokens of context (131,072 tokens).

What is Llama 3.2 1B Instruct best for?

Llama 3.2 1B Instruct is well suited to Smallest Llama, Edge / mobile and Open weights.

Who makes Llama 3.2 1B Instruct?

Llama 3.2 1B Instruct is developed and served by Meta. It was released in Sep 2024.