Meta Llama Models

The complete Meta Llama model lineup: 17 open-source models spanning from compact 1B-parameter variants to the flagship Llama 4 Maverick. Meta's Llama family is the most widely adopted open-source LLM ecosystem, offering free weights for self-hosting, fine-tuning, and commercial use. Scores updated hourly from live API data.

Llama Models

Avg Score

1.0M

Largest Context

Free Models

All Llama Models — Ranked by Score

17 models from Meta, sorted by composite score

#	Model	Score	Input $/1M	Output $/1M	Context	Vision	Reasoning	Tools	JSON
1	Llama 4 Maverick	52	$0.150	$0.600	1.0M		--
2	Llama 4 Scout	50	$0.080	$0.300	328K		--
3	Llama 3.3 70B Instruct (free)	46	Free	Free	128K	--	--		--
4	Llama Guard 4 12B	42	$0.180	$0.180	164K		--	--
5	Llama 3.3 70B Instruct	41	$0.100	$0.320	131K	--	--
6	Llama 3.2 11B Vision Instruct	41	$0.049	$0.049	131K		--	--
7	Llama 3.1 8B Instruct	35	$0.020	$0.050	16K	--	--
8	Llama 3.2 3B Instruct (free)	34	Free	Free	131K	--	--	--	--
9	Llama 3.1 405B Instruct	33	$4.00	$4.00	131K	--	--
10	Llama 3.1 70B Instruct	32	$0.400	$0.400	131K	--	--
11	Llama 3 8B Instruct	32	$0.030	$0.040	8K	--	--
12	Llama Guard 3 8B	31	$0.020	$0.060	131K	--	--	--	--
13	Llama 3.1 405B (base)	30	$4.00	$4.00	33K	--	--	--	--
14	Llama 3 70B Instruct	28	$0.510	$0.740	8K	--	--	--
15	Llama 3.2 3B Instruct	26	$0.051	$0.340	80K	--	--	--	--
16	Llama 3.2 1B Instruct	26	$0.027	$0.200	60K	--	--	--	--
17	LlamaGuard 2 8B	20	$0.200	$0.200	8K	--	--	--	--

Llama Pricing Comparison — Sorted by Output Price

API pricing via OpenRouter. Self-hosted Llama models are free to run on your own hardware.

Model	Input $/1M	Output $/1M	Context	Score
Llama 3.3 70B Instruct (free)	Free	Free	128K	46
Llama 3.2 3B Instruct (free)	Free	Free	131K	34
Llama 3 8B Instruct	$0.030	$0.040	8K	32
Llama 3.2 11B Vision Instruct	$0.049	$0.049	131K	41
Llama 3.1 8B Instruct	$0.020	$0.050	16K	35
Llama Guard 3 8B	$0.020	$0.060	131K	31
Llama Guard 4 12B	$0.180	$0.180	164K	42
Llama 3.2 1B Instruct	$0.027	$0.200	60K	26
LlamaGuard 2 8B	$0.200	$0.200	8K	20
Llama 4 Scout	$0.080	$0.300	328K	50
Llama 3.3 70B Instruct	$0.100	$0.320	131K	41
Llama 3.2 3B Instruct	$0.051	$0.340	80K	26
Llama 3.1 70B Instruct	$0.400	$0.400	131K	32
Llama 4 Maverick	$0.150	$0.600	1.0M	52
Llama 3 70B Instruct	$0.510	$0.740	8K	28
Llama 3.1 405B Instruct	$4.00	$4.00	131K	33
Llama 3.1 405B (base)	$4.00	$4.00	33K	30

Understanding the Llama Model Family

Open Source & Free

Meta releases Llama model weights under permissive open licenses, making them free to download, modify, and deploy commercially. This open approach has made Llama the most widely adopted open-source LLM family, powering thousands of applications, research projects, and fine-tuned variants. When accessed via API providers like OpenRouter, per-token pricing applies to cover inference infrastructure costs.

Llama Model Sizes

The Llama family spans a wide range of parameter counts to fit different hardware and performance needs. Smaller variants (1B, 3B, 8B) run efficiently on consumer GPUs and edge devices. Mid-range models (70B) offer strong general-purpose performance on server hardware. The largest models like Llama 4 Maverick push the frontier of open-source quality, competing with proprietary models on reasoning and coding benchmarks.

Self-Hosting with Ollama & vLLM

Llama models can be self-hosted using tools like Ollama (one-command local deployment), vLLM (high-throughput serving), or llama.cpp (CPU and quantized inference). Self-hosting eliminates per-token costs and keeps data on-premises, making Llama a popular choice for privacy-sensitive workloads and enterprise deployments.

Fine-Tuning

Because the weights are open, Llama models are the most popular base for fine-tuning. Techniques like LoRA and QLoRA allow efficient adaptation to specific domains (legal, medical, code) on a single GPU. The ecosystem includes tools like Hugging Face Transformers, Axolotl, and Unsloth for streamlined training. Many top open-source models on the leaderboard are Llama-based fine-tunes.

Vision Models

Reasoning Models

Function Calling

Top Score

Explore Llama comparisons, open-source rankings, and pricing across the full model landscape.

Llama vs Mistral Open Source Models Self-Hosted Models Free AI Models Compare Models Full Leaderboard