Large Context Window AI Models

Compare 275 AI models with context windows of 32K tokens or more. The largest models support 2M tokens -- enough to process entire codebases, books, or hundreds of documents in a single prompt. Data updated hourly from OpenRouter.

1M+ Tokens

200K+ Tokens

115

128K Tokens

32K–64K Tokens

Largest Context

275

Models (32K+)

277K

Average Context

1M+ Tokens

(35 models)

The largest context windows available. These models can process entire codebases, full books, or hundreds of documents in a single request.

#	Model	Provider	Context Window	Pages	Input / 1M	Output / 1M
1	Grok 4 Fast	xAI	2M	~3K	$0.20	$0.50
2	Grok 4.1 Fast	xAI	2M	~3K	$0.20	$0.50
3	Gemini 2.0 Flash	Google	1.0M	~1K	$0.10	$0.40
4	Gemini 2.0 Flash Lite	Google	1.0M	~1K	$0.07	$0.30
5	Gemini 2.5 Flash	Google	1.0M	~1K	$0.30	$2.50
6	Gemini 2.5 Flash Lite	Google	1.0M	~1K	$0.10	$0.40
7	Gemini 2.5 Flash Lite Preview 09-2025	Google	1.0M	~1K	$0.10	$0.40
8	Gemini 2.5 Pro	Google	1.0M	~1K	$1.25	$10.00
9	Gemini 2.5 Pro Preview 05-06	Google	1.0M	~1K	$1.25	$10.00
10	Gemini 2.5 Pro Preview 06-05	Google	1.0M	~1K	$1.25	$10.00
11	Gemini 3 Flash Preview	Google	1.0M	~1K	$0.50	$3.00
12	Gemini 3 Pro Preview	Google	1.0M	~1K	$2.00	$12.00
13	Gemini 3.1 Flash Lite Preview	Google	1.0M	~1K	$0.25	$1.50
14	Gemini 3.1 Pro Preview	Google	1.0M	~1K	$2.00	$12.00
15	Gemini 3.1 Pro Preview Custom Tools	Google	1.0M	~1K	$2.00	$12.00
16	Llama 4 Maverick	Meta	1.0M	~1K	$0.15	$0.60
17	GPT-4.1	OpenAI	1.0M	~1K	$2.00	$8.00
18	GPT-4.1 Mini	OpenAI	1.0M	~1K	$0.40	$1.60
19	GPT-4.1 Nano	OpenAI	1.0M	~1K	$0.10	$0.40
20	Palmyra X5	Writer	1.0M	~1K	$0.60	$6.00
21	MiniMax-01	MiniMax	1.0M	~1K	$0.20	$1.10
22	Claude Opus 4.6	Anthropic	1M	~1K	$5.00	$25.00
23	Claude Sonnet 4	Anthropic	1M	~1K	$3.00	$15.00
24	Claude Sonnet 4.5	Anthropic	1M	~1K	$3.00	$15.00
25	Claude Sonnet 4.6	Anthropic	1M	~1K	$3.00	$15.00
26	MiniMax M1	MiniMax	1M	~1K	$0.40	$2.20
27	Nova 2 Lite	Amazon	1M	~1K	$0.30	$2.50
28	Nova Premier 1.0	Amazon	1M	~1K	$2.50	$12.50
29	Qwen Plus 0728	Alibaba	1M	~1K	$0.26	$0.78
30	Qwen Plus 0728 (thinking)	Alibaba	1M	~1K	$0.26	$0.78
31	Qwen-Plus	Alibaba	1M	~1K	$0.40	$1.20
32	Qwen3 Coder Flash	Alibaba	1M	~1K	$0.20	$0.97
33	Qwen3 Coder Plus	Alibaba	1M	~1K	$0.65	$3.25
34	Qwen3.5 Plus 2026-02-15	Alibaba	1M	~1K	$0.26	$1.56
35	Qwen3.5-Flash	Alibaba	1M	~1K	$0.10	$0.40

200K+ Tokens

(74 models)

Extended context models ideal for long documents, legal contracts, research papers, and multi-file code analysis.

#	Model	Provider	Context Window	Pages	Input / 1M	Output / 1M
1	GPT-5	OpenAI	400K	~533	$1.25	$10.00
2	GPT-5 Codex	OpenAI	400K	~533	$1.25	$10.00
3	GPT-5 Image	OpenAI	400K	~533	$10.00	$10.00
4	GPT-5 Image Mini	OpenAI	400K	~533	$2.50	$2.00
5	GPT-5 Mini	OpenAI	400K	~533	$0.25	$2.00
6	GPT-5 Nano	OpenAI	400K	~533	$0.05	$0.40
7	GPT-5 Pro	OpenAI	400K	~533	$15.00	$120.00
8	GPT-5.1	OpenAI	400K	~533	$1.25	$10.00
9	GPT-5.1-Codex	OpenAI	400K	~533	$1.25	$10.00
10	GPT-5.1-Codex-Max	OpenAI	400K	~533	$1.25	$10.00
11	GPT-5.1-Codex-Mini	OpenAI	400K	~533	$0.25	$2.00
12	GPT-5.2	OpenAI	400K	~533	$1.75	$14.00
13	GPT-5.2 Pro	OpenAI	400K	~533	$21.00	$168.00
14	GPT-5.2-Codex	OpenAI	400K	~533	$1.75	$14.00
15	GPT-5.3-Codex	OpenAI	400K	~533	$1.75	$14.00
16	Llama 4 Scout	Meta	328K	~437	$0.08	$0.30
17	Nova Lite 1.0	Amazon	300K	~400	$0.06	$0.24
18	Nova Pro 1.0	Amazon	300K	~400	$0.80	$3.20
19	Devstral 2 2512	Mistral AI	262K	~350	$0.40	$2.00
20	Kimi K2 0905 (exacto)	Moonshot AI	262K	~350	$0.60	$2.50
21	Kimi K2.5	Moonshot AI	262K	~350	$0.45	$2.20
22	MiMo-V2-Flash	Xiaomi	262K	~350	$0.09	$0.29
23	Ministral 3 14B 2512	Mistral AI	262K	~350	$0.20	$0.20
24	Ministral 3 8B 2512	Mistral AI	262K	~350	$0.15	$0.15
25	Mistral Large 3 2512	Mistral AI	262K	~350	$0.50	$1.50
26	Nemotron 3 Nano 30B A3B	NVIDIA	262K	~350	$0.05	$0.20
27	Qwen3 235B A22B Instruct 2507	Alibaba	262K	~350	$0.07	$0.10
28	Qwen3 30B A3B Instruct 2507	Alibaba	262K	~350	$0.09	$0.30
29	Qwen3 Coder 480B A35B	Alibaba	262K	~350	$0.22	$1.00
30	Qwen3 Coder 480B A35B (exacto)	Alibaba	262K	~350	$0.22	$1.80
31	Qwen3 Coder Next	Alibaba	262K	~350	$0.12	$0.75
32	Qwen3 Max	Alibaba	262K	~350	$1.20	$6.00
33	Qwen3 Max Thinking	Alibaba	262K	~350	$0.78	$3.90
34	Qwen3 Next 80B A3B Instruct	Alibaba	262K	~350	$0.09	$1.10
35	Qwen3 Next 80B A3B Instruct (free)	Alibaba	262K	~350	Free	Free
36	Qwen3 VL 235B A22B Instruct	Alibaba	262K	~350	$0.20	$0.88
37	Qwen3.5 397B A17B	Alibaba	262K	~350	$0.39	$2.34
38	Qwen3.5-122B-A10B	Alibaba	262K	~350	$0.26	$2.08
39	Qwen3.5-27B	Alibaba	262K	~350	$0.20	$1.56
40	Qwen3.5-35B-A3B	Alibaba	262K	~350	$0.16	$1.30
41	Seed 1.6	ByteDance	262K	~350	$0.25	$2.00
42	Seed 1.6 Flash	ByteDance	262K	~350	$0.07	$0.30
43	Seed-2.0-Mini	ByteDance	262K	~350	$0.10	$0.40
44	Qwen3 Coder 480B A35B (free)	Alibaba	262K	~349	Free	Free
45	Codestral 2508	Mistral AI	256K	~341	$0.30	$0.90
46	Command A	Cohere	256K	~341	$2.50	$10.00
47	Grok 4	xAI	256K	~341	$3.00	$15.00
48	Grok Code Fast 1	xAI	256K	~341	$0.20	$1.50
49	Jamba Large 1.7	AI21 Labs	256K	~341	$2.00	$8.00
50	KAT-Coder-Pro V1	Kuaishou	256K	~341	$0.21	$0.83
51	Nemotron 3 Nano 30B A3B (free)	NVIDIA	256K	~341	Free	Free
52	Step 3.5 Flash	StepFun	256K	~341	$0.10	$0.30
53	Step 3.5 Flash (free)	StepFun	256K	~341	Free	Free
54	Claude 3 Haiku	Anthropic	200K	~267	$0.25	$1.25
55	Claude 3.5 Haiku	Anthropic	200K	~267	$0.80	$4.00
56	Claude 3.5 Sonnet	Anthropic	200K	~267	$6.00	$30.00
57	Claude 3.7 Sonnet	Anthropic	200K	~267	$3.00	$15.00
58	Claude 3.7 Sonnet (thinking)	Anthropic	200K	~267	$3.00	$15.00
59	Claude Haiku 4.5	Anthropic	200K	~267	$1.00	$5.00
60	Claude Opus 4	Anthropic	200K	~267	$15.00	$75.00
61	Claude Opus 4.1	Anthropic	200K	~267	$15.00	$75.00
62	Claude Opus 4.5	Anthropic	200K	~267	$5.00	$25.00
63	o1	OpenAI	200K	~267	$15.00	$60.00
64	o1-pro	OpenAI	200K	~267	$150.00	$600.00
65	o3	OpenAI	200K	~267	$2.00	$8.00
66	o3 Deep Research	OpenAI	200K	~267	$10.00	$40.00
67	o3 Mini	OpenAI	200K	~267	$1.10	$4.40
68	o3 Mini High	OpenAI	200K	~267	$1.10	$4.40
69	o3 Pro	OpenAI	200K	~267	$20.00	$80.00
70	o4 Mini	OpenAI	200K	~267	$1.10	$4.40
71	o4 Mini Deep Research	OpenAI	200K	~267	$2.00	$8.00
72	o4 Mini High	OpenAI	200K	~267	$1.10	$4.40
73	Sonar Pro	Perplexity	200K	~267	$3.00	$15.00
74	Sonar Pro Search	Perplexity	200K	~267	$3.00	$15.00

128K Tokens

(115 models)

The current standard for frontier models. Sufficient for most production use cases, including long conversations and medium-length documents.

#	Model	Provider	Context Window	Pages	Input / 1M	Output / 1M
1	MiniMax M2	MiniMax	197K	~262	$0.26	$1.00
2	MiniMax M2.1	MiniMax	197K	~262	$0.27	$0.95
3	MiniMax M2.5	MiniMax	197K	~262	$0.29	$1.20
4	DeepSeek V3	DeepSeek	164K	~218	$0.32	$0.89
5	DeepSeek V3 0324	DeepSeek	164K	~218	$0.20	$0.77
6	DeepSeek V3.1 Terminus	DeepSeek	164K	~218	$0.21	$0.79
7	DeepSeek V3.1 Terminus (exacto)	DeepSeek	164K	~218	$0.21	$0.79
8	DeepSeek V3.2	DeepSeek	164K	~218	$0.25	$0.40
9	DeepSeek V3.2 Exp	DeepSeek	164K	~218	$0.27	$0.41
10	DeepSeek V3.2 Speciale	DeepSeek	164K	~218	$0.40	$1.20
11	Llama Guard 4 12B	Meta	164K	~218	$0.18	$0.18
12	R1 0528	DeepSeek	164K	~218	$0.45	$2.15
13	Qwen3 Coder 30B A3B Instruct	Alibaba	160K	~213	$0.07	$0.27
14	Aion-1.0	aion-labs	131K	~175	$4.00	$8.00
15	Aion-1.0-Mini	aion-labs	131K	~175	$0.70	$1.40
16	Aion-2.0	aion-labs	131K	~175	$0.80	$1.60
17	Devstral Medium	Mistral AI	131K	~175	$0.40	$2.00
18	Devstral Small 1.1	Mistral AI	131K	~175	$0.10	$0.30
19	ERNIE 4.5 21B A3B Thinking	Baidu	131K	~175	$0.07	$0.28
20	Gemma 3 12B	Google	131K	~175	$0.04	$0.13
21	Gemma 3 27B (free)	Google	131K	~175	Free	Free
22	Gemma 3 4B	Google	131K	~175	$0.04	$0.08
23	gpt-oss-120b	OpenAI	131K	~175	$0.04	$0.19
24	gpt-oss-120b (exacto)	OpenAI	131K	~175	$0.04	$0.19
25	gpt-oss-120b (free)	OpenAI	131K	~175	Free	Free
26	gpt-oss-20b	OpenAI	131K	~175	$0.03	$0.14
27	gpt-oss-20b (free)	OpenAI	131K	~175	Free	Free
28	gpt-oss-safeguard-20b	OpenAI	131K	~175	$0.07	$0.30
29	Grok 3	xAI	131K	~175	$3.00	$15.00
30	Grok 3 Beta	xAI	131K	~175	$3.00	$15.00
31	Grok 3 Mini	xAI	131K	~175	$0.30	$0.50
32	Grok 3 Mini Beta	xAI	131K	~175	$0.30	$0.50
33	Hunyuan A13B Instruct	Tencent	131K	~175	$0.14	$0.57
34	Kimi K2 0905	Moonshot AI	131K	~175	$0.40	$2.00
35	Kimi K2 Thinking	Moonshot AI	131K	~175	$0.47	$2.00
36	Llama 3.1 70B Instruct	Meta	131K	~175	$0.40	$0.40
37	Llama 3.1 Nemotron 70B Instruct	NVIDIA	131K	~175	$1.20	$1.20
38	Llama 3.2 11B Vision Instruct	Meta	131K	~175	$0.05	$0.05
39	Llama 3.2 3B Instruct (free)	Meta	131K	~175	Free	Free
40	Llama 3.3 70B Instruct	Meta	131K	~175	$0.10	$0.32
41	Llama 3.3 Nemotron Super 49B V1.5	NVIDIA	131K	~175	$0.10	$0.40
42	Llama Guard 3 8B	Meta	131K	~175	$0.02	$0.06
43	LongCat Flash Chat	Meituan	131K	~175	$0.20	$0.80
44	Maestro Reasoning	arcee-ai	131K	~175	$0.90	$3.30
45	Ministral 3 3B 2512	Mistral AI	131K	~175	$0.10	$0.10
46	Mistral Large 2407	Mistral AI	131K	~175	$2.00	$6.00
47	Mistral Large 2411	Mistral AI	131K	~175	$2.00	$6.00
48	Mistral Medium 3	Mistral AI	131K	~175	$0.40	$2.00
49	Mistral Medium 3.1	Mistral AI	131K	~175	$0.40	$2.00
50	Mistral Nemo	Mistral AI	131K	~175	$0.02	$0.04
51	Mistral Small 3.2 24B	Mistral AI	131K	~175	$0.06	$0.18
52	Nemotron Nano 12B 2 VL	NVIDIA	131K	~175	$0.20	$0.60
53	Nemotron Nano 9B V2	NVIDIA	131K	~175	$0.04	$0.16
54	Pixtral Large 2411	Mistral AI	131K	~175	$2.00	$6.00
55	Qwen VL Max	Alibaba	131K	~175	$0.80	$3.20
56	Qwen VL Plus	Alibaba	131K	~175	$0.14	$0.41
57	Qwen-Turbo	Alibaba	131K	~175	$0.03	$0.13
58	Qwen3 235B A22B	Alibaba	131K	~175	$0.45	$1.82
59	Qwen3 235B A22B Thinking 2507	Alibaba	131K	~175	Free	Free
60	Qwen3 VL 235B A22B Thinking	Alibaba	131K	~175	Free	Free
61	Qwen3 VL 30B A3B Instruct	Alibaba	131K	~175	$0.13	$0.52
62	Qwen3 VL 30B A3B Thinking	Alibaba	131K	~175	Free	Free
63	Qwen3 VL 32B Instruct	Alibaba	131K	~175	$0.10	$0.42
64	Qwen3 VL 8B Instruct	Alibaba	131K	~175	$0.08	$0.50
65	Qwen3 VL 8B Thinking	Alibaba	131K	~175	$0.12	$1.36
66	R1 Distill Llama 70B	DeepSeek	131K	~175	$0.70	$0.80
67	Spotlight	arcee-ai	131K	~175	$0.18	$0.18
68	Tongyi DeepResearch 30B A3B	Alibaba	131K	~175	$0.09	$0.45
69	Trinity Mini	arcee-ai	131K	~175	$0.04	$0.15
70	Trinity Mini (free)	arcee-ai	131K	~175	Free	Free
71	Virtuoso Large	arcee-ai	131K	~175	$0.75	$1.20
72	Granite 4.0 Micro	IBM	131K	~175	$0.02	$0.11
73	Kimi K2 0711	Moonshot AI	131K	~175	$0.55	$2.20
74	Llama 3.1 405B Instruct	Meta	131K	~175	$4.00	$4.00
75	Trinity Large Preview (free)	arcee-ai	131K	~175	Free	Free
76	Cogito v2.1 671B	deepcogito	128K	~171	$1.25	$1.25
77	Command R (08-2024)	Cohere	128K	~171	$0.15	$0.60
78	Command R+ (08-2024)	Cohere	128K	~171	$2.50	$10.00
79	Command R7B (12-2024)	Cohere	128K	~171	$0.04	$0.15
80	Gemma 3 27B	Google	128K	~171	$0.04	$0.15
81	GPT Audio	OpenAI	128K	~171	$2.50	$10.00
82	GPT Audio Mini	OpenAI	128K	~171	$0.60	$2.40
83	GPT-4 Turbo	OpenAI	128K	~171	$10.00	$30.00
84	GPT-4 Turbo (older v1106)	OpenAI	128K	~171	$10.00	$30.00
85	GPT-4 Turbo Preview	OpenAI	128K	~171	$10.00	$30.00
86	GPT-4o	OpenAI	128K	~171	$2.50	$10.00
87	GPT-4o (2024-05-13)	OpenAI	128K	~171	$5.00	$15.00
88	GPT-4o (2024-08-06)	OpenAI	128K	~171	$2.50	$10.00
89	GPT-4o (2024-11-20)	OpenAI	128K	~171	$2.50	$10.00
90	GPT-4o (extended)	OpenAI	128K	~171	$6.00	$18.00
91	GPT-4o Audio	OpenAI	128K	~171	$2.50	$10.00
92	GPT-4o Search Preview	OpenAI	128K	~171	$2.50	$10.00
93	GPT-4o-mini	OpenAI	128K	~171	$0.15	$0.60
94	GPT-4o-mini (2024-07-18)	OpenAI	128K	~171	$0.15	$0.60
95	GPT-4o-mini Search Preview	OpenAI	128K	~171	$0.15	$0.60
96	GPT-5 Chat	OpenAI	128K	~171	$1.25	$10.00
97	GPT-5.1 Chat	OpenAI	128K	~171	$1.25	$10.00
98	GPT-5.2 Chat	OpenAI	128K	~171	$1.75	$14.00
99	GPT-5.3 Chat	OpenAI	128K	~171	$1.75	$14.00
100	Llama 3.3 70B Instruct (free)	Meta	128K	~171	Free	Free
101	Mercury	Inception	128K	~171	$0.25	$0.75
102	Mercury Coder	Inception	128K	~171	$0.25	$0.75
103	Mistral Large	Mistral AI	128K	~171	$2.00	$6.00
104	Mistral Small 3.1 24B	Mistral AI	128K	~171	$0.35	$0.56
105	Mistral Small 3.1 24B (free)	Mistral AI	128K	~171	Free	Free
106	Nemotron Nano 12B 2 VL (free)	NVIDIA	128K	~171	Free	Free
107	Nemotron Nano 9B V2 (free)	NVIDIA	128K	~171	Free	Free
108	Nova Micro 1.0	Amazon	128K	~171	$0.04	$0.14
109	Olmo 2 32B Instruct	Allen AI	128K	~171	$0.05	$0.20
110	Qwen2.5 VL 32B Instruct	Alibaba	128K	~171	$0.20	$0.60
111	Qwen3 Next 80B A3B Thinking	Alibaba	128K	~171	$0.15	$1.20
112	Solar Pro 3	Upstage	128K	~171	$0.15	$0.60
113	Sonar Deep Research	Perplexity	128K	~171	$2.00	$8.00
114	Sonar Reasoning Pro	Perplexity	128K	~171	$2.00	$8.00
115	UI-TARS 7B	ByteDance	128K	~171	$0.10	$0.20

32K–64K Tokens

(51 models)

Moderate context windows suitable for shorter documents, code files, and focused conversations.

#	Model	Provider	Context Window	Pages	Input / 1M	Output / 1M
1	Sonar	Perplexity	127K	~169	$1.00	$1.00
2	ERNIE 4.5 300B A47B	Baidu	123K	~164	$0.28	$1.10
3	ERNIE 4.5 VL 424B A47B	Baidu	123K	~164	$0.42	$1.25
4	ERNIE 4.5 21B A3B	Baidu	120K	~160	$0.07	$0.28
5	Llama 3.2 3B Instruct	Meta	80K	~107	$0.05	$0.34
6	MiniMax M2-her	MiniMax	66K	~87	$0.30	$1.20
7	Mixtral 8x22B Instruct	Mistral AI	66K	~87	$2.00	$6.00
8	Nano Banana 2 (Gemini 3.1 Flash Image Preview)	Google	66K	~87	$0.50	$3.00
9	Nano Banana Pro (Gemini 3 Pro Image Preview)	Google	66K	~87	$2.00	$12.00
10	Olmo 3 32B Think	Allen AI	66K	~87	$0.15	$0.50
11	Olmo 3 7B Instruct	Allen AI	66K	~87	$0.10	$0.20
12	Olmo 3 7B Think	Allen AI	66K	~87	$0.12	$0.20
13	Olmo 3.1 32B Instruct	Allen AI	66K	~87	$0.20	$0.60
14	Olmo 3.1 32B Think	Allen AI	66K	~87	$0.15	$0.50
15	WizardLM-2 8x22B	Microsoft	66K	~87	$0.62	$0.62
16	R1	DeepSeek	64K	~85	$0.70	$2.50
17	Llama 3.2 1B Instruct	Meta	60K	~80	$0.03	$0.20
18	Qwen3 14B	Alibaba	41K	~55	$0.06	$0.24
19	Qwen3 30B A3B	Alibaba	41K	~55	$0.08	$0.28
20	Qwen3 32B	Alibaba	41K	~55	$0.08	$0.24
21	Qwen3 4B (free)	Alibaba	41K	~55	Free	Free
22	Qwen3 8B	Alibaba	41K	~55	$0.05	$0.40
23	Molmo2 8B	Allen AI	37K	~49	$0.20	$0.20
24	Coder Large	arcee-ai	33K	~44	$0.50	$0.80
25	DeepSeek V3.1	DeepSeek	33K	~44	$0.15	$0.75
26	Gemma 3 12B (free)	Google	33K	~44	Free	Free
27	Gemma 3 4B (free)	Google	33K	~44	Free	Free
28	Gemma 3n 4B	Google	33K	~44	$0.02	$0.04
29	LFM2-2.6B	Liquid AI	33K	~44	$0.01	$0.02
30	LFM2-24B-A2B	Liquid AI	33K	~44	$0.03	$0.12
31	LFM2-8B-A1B	Liquid AI	33K	~44	$0.01	$0.02
32	LFM2.5-1.2B-Instruct (free)	Liquid AI	33K	~44	Free	Free
33	LFM2.5-1.2B-Thinking (free)	Liquid AI	33K	~44	Free	Free
34	Llama 3.1 405B (base)	Meta	33K	~44	$4.00	$4.00
35	Mistral Small 3	Mistral AI	33K	~44	$0.05	$0.08
36	Mistral Small Creative	Mistral AI	33K	~44	$0.10	$0.30
37	Mixtral 8x7B Instruct	Mistral AI	33K	~44	$0.54	$0.54
38	Nano Banana (Gemini 2.5 Flash Image)	Google	33K	~44	$0.30	$2.50
39	Qwen-Max	Alibaba	33K	~44	$1.04	$4.16
40	Qwen2.5 72B Instruct	Alibaba	33K	~44	$0.12	$0.39
41	Qwen2.5 7B Instruct	Alibaba	33K	~44	$0.04	$0.10
42	Qwen2.5 Coder 32B Instruct	Alibaba	33K	~44	$0.20	$0.20
43	Qwen2.5 Coder 7B Instruct	Alibaba	33K	~44	$0.03	$0.09
44	Qwen2.5 VL 72B Instruct	Alibaba	33K	~44	$0.80	$0.80
45	Qwen2.5-VL 7B Instruct	Alibaba	33K	~44	$0.20	$0.20
46	Qwen3 30B A3B Thinking 2507	Alibaba	33K	~44	$0.05	$0.34
47	QwQ 32B	Alibaba	33K	~44	$0.15	$0.40
48	R1 Distill Qwen 32B	DeepSeek	33K	~44	$0.29	$0.29
49	Rnj 1 Instruct	essentialai	33K	~44	$0.15	$0.15
50	Saba	Mistral AI	33K	~44	$0.20	$0.60
51	Voxtral Small 24B 2507	Mistral AI	32K	~43	$0.10	$0.30

What Is a Context Window and Why Does It Matter?

Context window = working memory

A model's context window is the total number of tokens (roughly words) it can process in a single request. This includes both your input prompt and the model's output. A model with a 128K context window can process about 170 pages of text at once, while a 1M-token model can handle roughly 1,300 pages -- enough for entire books or large codebases.

Larger context enables new use cases

With small context windows (under 32K), you must chunk documents and use retrieval-augmented generation (RAG). Large context models eliminate this complexity for many workloads: analyzing full legal contracts, reviewing entire repositories, summarizing research paper collections, or maintaining very long conversations with full history retained.

Context size vs. effective recall

Not all context is created equal. Some models perform well on "needle in a haystack" tests at their full context length, while others degrade on information retrieval when prompts get very long. The advertised context window is the maximum, but effective performance may vary. Check our leaderboard for quality scores that account for real-world performance.

Cost implications of large context

Using a large context window means sending more tokens per request, which increases cost. For example, filling a 1M-token context at $3/1M input tokens costs $3 per request. For cost-sensitive workloads, consider whether RAG with a smaller context model might be more efficient than filling a large context window end to end.

Explore More

Dive deeper into model capabilities, compare context windows side by side, or see overall rankings across all dimensions.

Model Rankings|Context Window Comparison Tool|Compare Models Side by Side