Compare 275 AI models with context windows of 32K tokens or more. The largest models support 2M tokens -- enough to process entire codebases, books, or hundreds of documents in a single prompt. Data updated hourly from OpenRouter.
The largest context windows available. These models can process entire codebases, full books, or hundreds of documents in a single request.
| # | Model | Provider | Context Window | Pages | Input / 1M | Output / 1M |
|---|---|---|---|---|---|---|
| 1 | Grok 4 Fast | xAI | 2M | ~3K | $0.20 | $0.50 |
| 2 | Grok 4.1 Fast | xAI | 2M | ~3K | $0.20 | $0.50 |
| 3 | Gemini 2.0 Flash | 1.0M | ~1K | $0.10 | $0.40 | |
| 4 | Gemini 2.0 Flash Lite | 1.0M | ~1K | $0.07 | $0.30 | |
| 5 | Gemini 2.5 Flash | 1.0M | ~1K | $0.30 | $2.50 | |
| 6 | Gemini 2.5 Flash Lite | 1.0M | ~1K | $0.10 | $0.40 | |
| 7 | Gemini 2.5 Flash Lite Preview 09-2025 | 1.0M | ~1K | $0.10 | $0.40 | |
| 8 | Gemini 2.5 Pro | 1.0M | ~1K | $1.25 | $10.00 | |
| 9 | Gemini 2.5 Pro Preview 05-06 | 1.0M | ~1K | $1.25 | $10.00 | |
| 10 | Gemini 2.5 Pro Preview 06-05 | 1.0M | ~1K | $1.25 | $10.00 | |
| 11 | Gemini 3 Flash Preview | 1.0M | ~1K | $0.50 | $3.00 | |
| 12 | Gemini 3 Pro Preview | 1.0M | ~1K | $2.00 | $12.00 | |
| 13 | Gemini 3.1 Flash Lite Preview | 1.0M | ~1K | $0.25 | $1.50 | |
| 14 | Gemini 3.1 Pro Preview | 1.0M | ~1K | $2.00 | $12.00 | |
| 15 | Gemini 3.1 Pro Preview Custom Tools | 1.0M | ~1K | $2.00 | $12.00 | |
| 16 | Llama 4 Maverick | Meta | 1.0M | ~1K | $0.15 | $0.60 |
| 17 | GPT-4.1 | OpenAI | 1.0M | ~1K | $2.00 | $8.00 |
| 18 | GPT-4.1 Mini | OpenAI | 1.0M | ~1K | $0.40 | $1.60 |
| 19 | GPT-4.1 Nano | OpenAI | 1.0M | ~1K | $0.10 | $0.40 |
| 20 | Palmyra X5 | Writer | 1.0M | ~1K | $0.60 | $6.00 |
| 21 | MiniMax-01 | MiniMax | 1.0M | ~1K | $0.20 | $1.10 |
| 22 | Claude Opus 4.6 | Anthropic | 1M | ~1K | $5.00 | $25.00 |
| 23 | Claude Sonnet 4 | Anthropic | 1M | ~1K | $3.00 | $15.00 |
| 24 | Claude Sonnet 4.5 | Anthropic | 1M | ~1K | $3.00 | $15.00 |
| 25 | Claude Sonnet 4.6 | Anthropic | 1M | ~1K | $3.00 | $15.00 |
| 26 | MiniMax M1 | MiniMax | 1M | ~1K | $0.40 | $2.20 |
| 27 | Nova 2 Lite | Amazon | 1M | ~1K | $0.30 | $2.50 |
| 28 | Nova Premier 1.0 | Amazon | 1M | ~1K | $2.50 | $12.50 |
| 29 | Qwen Plus 0728 | Alibaba | 1M | ~1K | $0.26 | $0.78 |
| 30 | Qwen Plus 0728 (thinking) | Alibaba | 1M | ~1K | $0.26 | $0.78 |
| 31 | Qwen-Plus | Alibaba | 1M | ~1K | $0.40 | $1.20 |
| 32 | Qwen3 Coder Flash | Alibaba | 1M | ~1K | $0.20 | $0.97 |
| 33 | Qwen3 Coder Plus | Alibaba | 1M | ~1K | $0.65 | $3.25 |
| 34 | Qwen3.5 Plus 2026-02-15 | Alibaba | 1M | ~1K | $0.26 | $1.56 |
| 35 | Qwen3.5-Flash | Alibaba | 1M | ~1K | $0.10 | $0.40 |
Extended context models ideal for long documents, legal contracts, research papers, and multi-file code analysis.
| # | Model | Provider | Context Window | Pages | Input / 1M | Output / 1M |
|---|---|---|---|---|---|---|
| 1 | GPT-5 | OpenAI | 400K | ~533 | $1.25 | $10.00 |
| 2 | GPT-5 Codex | OpenAI | 400K | ~533 | $1.25 | $10.00 |
| 3 | GPT-5 Image | OpenAI | 400K | ~533 | $10.00 | $10.00 |
| 4 | GPT-5 Image Mini | OpenAI | 400K | ~533 | $2.50 | $2.00 |
| 5 | GPT-5 Mini | OpenAI | 400K | ~533 | $0.25 | $2.00 |
| 6 | GPT-5 Nano | OpenAI | 400K | ~533 | $0.05 | $0.40 |
| 7 | GPT-5 Pro | OpenAI | 400K | ~533 | $15.00 | $120.00 |
| 8 | GPT-5.1 | OpenAI | 400K | ~533 | $1.25 | $10.00 |
| 9 | GPT-5.1-Codex | OpenAI | 400K | ~533 | $1.25 | $10.00 |
| 10 | GPT-5.1-Codex-Max | OpenAI | 400K | ~533 | $1.25 | $10.00 |
| 11 | GPT-5.1-Codex-Mini | OpenAI | 400K | ~533 | $0.25 | $2.00 |
| 12 | GPT-5.2 | OpenAI | 400K | ~533 | $1.75 | $14.00 |
| 13 | GPT-5.2 Pro | OpenAI | 400K | ~533 | $21.00 | $168.00 |
| 14 | GPT-5.2-Codex | OpenAI | 400K | ~533 | $1.75 | $14.00 |
| 15 | GPT-5.3-Codex | OpenAI | 400K | ~533 | $1.75 | $14.00 |
| 16 | Llama 4 Scout | Meta | 328K | ~437 | $0.08 | $0.30 |
| 17 | Nova Lite 1.0 | Amazon | 300K | ~400 | $0.06 | $0.24 |
| 18 | Nova Pro 1.0 | Amazon | 300K | ~400 | $0.80 | $3.20 |
| 19 | Devstral 2 2512 | Mistral AI | 262K | ~350 | $0.40 | $2.00 |
| 20 | Kimi K2 0905 (exacto) | Moonshot AI | 262K | ~350 | $0.60 | $2.50 |
| 21 | Kimi K2.5 | Moonshot AI | 262K | ~350 | $0.45 | $2.20 |
| 22 | MiMo-V2-Flash | Xiaomi | 262K | ~350 | $0.09 | $0.29 |
| 23 | Ministral 3 14B 2512 | Mistral AI | 262K | ~350 | $0.20 | $0.20 |
| 24 | Ministral 3 8B 2512 | Mistral AI | 262K | ~350 | $0.15 | $0.15 |
| 25 | Mistral Large 3 2512 | Mistral AI | 262K | ~350 | $0.50 | $1.50 |
| 26 | Nemotron 3 Nano 30B A3B | NVIDIA | 262K | ~350 | $0.05 | $0.20 |
| 27 | Qwen3 235B A22B Instruct 2507 | Alibaba | 262K | ~350 | $0.07 | $0.10 |
| 28 | Qwen3 30B A3B Instruct 2507 | Alibaba | 262K | ~350 | $0.09 | $0.30 |
| 29 | Qwen3 Coder 480B A35B | Alibaba | 262K | ~350 | $0.22 | $1.00 |
| 30 | Qwen3 Coder 480B A35B (exacto) | Alibaba | 262K | ~350 | $0.22 | $1.80 |
| 31 | Qwen3 Coder Next | Alibaba | 262K | ~350 | $0.12 | $0.75 |
| 32 | Qwen3 Max | Alibaba | 262K | ~350 | $1.20 | $6.00 |
| 33 | Qwen3 Max Thinking | Alibaba | 262K | ~350 | $0.78 | $3.90 |
| 34 | Qwen3 Next 80B A3B Instruct | Alibaba | 262K | ~350 | $0.09 | $1.10 |
| 35 | Qwen3 Next 80B A3B Instruct (free) | Alibaba | 262K | ~350 | Free | Free |
| 36 | Qwen3 VL 235B A22B Instruct | Alibaba | 262K | ~350 | $0.20 | $0.88 |
| 37 | Qwen3.5 397B A17B | Alibaba | 262K | ~350 | $0.39 | $2.34 |
| 38 | Qwen3.5-122B-A10B | Alibaba | 262K | ~350 | $0.26 | $2.08 |
| 39 | Qwen3.5-27B | Alibaba | 262K | ~350 | $0.20 | $1.56 |
| 40 | Qwen3.5-35B-A3B | Alibaba | 262K | ~350 | $0.16 | $1.30 |
| 41 | Seed 1.6 | ByteDance | 262K | ~350 | $0.25 | $2.00 |
| 42 | Seed 1.6 Flash | ByteDance | 262K | ~350 | $0.07 | $0.30 |
| 43 | Seed-2.0-Mini | ByteDance | 262K | ~350 | $0.10 | $0.40 |
| 44 | Qwen3 Coder 480B A35B (free) | Alibaba | 262K | ~349 | Free | Free |
| 45 | Codestral 2508 | Mistral AI | 256K | ~341 | $0.30 | $0.90 |
| 46 | Command A | Cohere | 256K | ~341 | $2.50 | $10.00 |
| 47 | Grok 4 | xAI | 256K | ~341 | $3.00 | $15.00 |
| 48 | Grok Code Fast 1 | xAI | 256K | ~341 | $0.20 | $1.50 |
| 49 | Jamba Large 1.7 | AI21 Labs | 256K | ~341 | $2.00 | $8.00 |
| 50 | KAT-Coder-Pro V1 | Kuaishou | 256K | ~341 | $0.21 | $0.83 |
| 51 | Nemotron 3 Nano 30B A3B (free) | NVIDIA | 256K | ~341 | Free | Free |
| 52 | Step 3.5 Flash | StepFun | 256K | ~341 | $0.10 | $0.30 |
| 53 | Step 3.5 Flash (free) | StepFun | 256K | ~341 | Free | Free |
| 54 | Claude 3 Haiku | Anthropic | 200K | ~267 | $0.25 | $1.25 |
| 55 | Claude 3.5 Haiku | Anthropic | 200K | ~267 | $0.80 | $4.00 |
| 56 | Claude 3.5 Sonnet | Anthropic | 200K | ~267 | $6.00 | $30.00 |
| 57 | Claude 3.7 Sonnet | Anthropic | 200K | ~267 | $3.00 | $15.00 |
| 58 | Claude 3.7 Sonnet (thinking) | Anthropic | 200K | ~267 | $3.00 | $15.00 |
| 59 | Claude Haiku 4.5 | Anthropic | 200K | ~267 | $1.00 | $5.00 |
| 60 | Claude Opus 4 | Anthropic | 200K | ~267 | $15.00 | $75.00 |
| 61 | Claude Opus 4.1 | Anthropic | 200K | ~267 | $15.00 | $75.00 |
| 62 | Claude Opus 4.5 | Anthropic | 200K | ~267 | $5.00 | $25.00 |
| 63 | o1 | OpenAI | 200K | ~267 | $15.00 | $60.00 |
| 64 | o1-pro | OpenAI | 200K | ~267 | $150.00 | $600.00 |
| 65 | o3 | OpenAI | 200K | ~267 | $2.00 | $8.00 |
| 66 | o3 Deep Research | OpenAI | 200K | ~267 | $10.00 | $40.00 |
| 67 | o3 Mini | OpenAI | 200K | ~267 | $1.10 | $4.40 |
| 68 | o3 Mini High | OpenAI | 200K | ~267 | $1.10 | $4.40 |
| 69 | o3 Pro | OpenAI | 200K | ~267 | $20.00 | $80.00 |
| 70 | o4 Mini | OpenAI | 200K | ~267 | $1.10 | $4.40 |
| 71 | o4 Mini Deep Research | OpenAI | 200K | ~267 | $2.00 | $8.00 |
| 72 | o4 Mini High | OpenAI | 200K | ~267 | $1.10 | $4.40 |
| 73 | Sonar Pro | Perplexity | 200K | ~267 | $3.00 | $15.00 |
| 74 | Sonar Pro Search | Perplexity | 200K | ~267 | $3.00 | $15.00 |
The current standard for frontier models. Sufficient for most production use cases, including long conversations and medium-length documents.
| # | Model | Provider | Context Window | Pages | Input / 1M | Output / 1M |
|---|---|---|---|---|---|---|
| 1 | MiniMax M2 | MiniMax | 197K | ~262 | $0.26 | $1.00 |
| 2 | MiniMax M2.1 | MiniMax | 197K | ~262 | $0.27 | $0.95 |
| 3 | MiniMax M2.5 | MiniMax | 197K | ~262 | $0.29 | $1.20 |
| 4 | DeepSeek V3 | DeepSeek | 164K | ~218 | $0.32 | $0.89 |
| 5 | DeepSeek V3 0324 | DeepSeek | 164K | ~218 | $0.20 | $0.77 |
| 6 | DeepSeek V3.1 Terminus | DeepSeek | 164K | ~218 | $0.21 | $0.79 |
| 7 | DeepSeek V3.1 Terminus (exacto) | DeepSeek | 164K | ~218 | $0.21 | $0.79 |
| 8 | DeepSeek V3.2 | DeepSeek | 164K | ~218 | $0.25 | $0.40 |
| 9 | DeepSeek V3.2 Exp | DeepSeek | 164K | ~218 | $0.27 | $0.41 |
| 10 | DeepSeek V3.2 Speciale | DeepSeek | 164K | ~218 | $0.40 | $1.20 |
| 11 | Llama Guard 4 12B | Meta | 164K | ~218 | $0.18 | $0.18 |
| 12 | R1 0528 | DeepSeek | 164K | ~218 | $0.45 | $2.15 |
| 13 | Qwen3 Coder 30B A3B Instruct | Alibaba | 160K | ~213 | $0.07 | $0.27 |
| 14 | Aion-1.0 | aion-labs | 131K | ~175 | $4.00 | $8.00 |
| 15 | Aion-1.0-Mini | aion-labs | 131K | ~175 | $0.70 | $1.40 |
| 16 | Aion-2.0 | aion-labs | 131K | ~175 | $0.80 | $1.60 |
| 17 | Devstral Medium | Mistral AI | 131K | ~175 | $0.40 | $2.00 |
| 18 | Devstral Small 1.1 | Mistral AI | 131K | ~175 | $0.10 | $0.30 |
| 19 | ERNIE 4.5 21B A3B Thinking | Baidu | 131K | ~175 | $0.07 | $0.28 |
| 20 | Gemma 3 12B | 131K | ~175 | $0.04 | $0.13 | |
| 21 | Gemma 3 27B (free) | 131K | ~175 | Free | Free | |
| 22 | Gemma 3 4B | 131K | ~175 | $0.04 | $0.08 | |
| 23 | gpt-oss-120b | OpenAI | 131K | ~175 | $0.04 | $0.19 |
| 24 | gpt-oss-120b (exacto) | OpenAI | 131K | ~175 | $0.04 | $0.19 |
| 25 | gpt-oss-120b (free) | OpenAI | 131K | ~175 | Free | Free |
| 26 | gpt-oss-20b | OpenAI | 131K | ~175 | $0.03 | $0.14 |
| 27 | gpt-oss-20b (free) | OpenAI | 131K | ~175 | Free | Free |
| 28 | gpt-oss-safeguard-20b | OpenAI | 131K | ~175 | $0.07 | $0.30 |
| 29 | Grok 3 | xAI | 131K | ~175 | $3.00 | $15.00 |
| 30 | Grok 3 Beta | xAI | 131K | ~175 | $3.00 | $15.00 |
| 31 | Grok 3 Mini | xAI | 131K | ~175 | $0.30 | $0.50 |
| 32 | Grok 3 Mini Beta | xAI | 131K | ~175 | $0.30 | $0.50 |
| 33 | Hunyuan A13B Instruct | Tencent | 131K | ~175 | $0.14 | $0.57 |
| 34 | Kimi K2 0905 | Moonshot AI | 131K | ~175 | $0.40 | $2.00 |
| 35 | Kimi K2 Thinking | Moonshot AI | 131K | ~175 | $0.47 | $2.00 |
| 36 | Llama 3.1 70B Instruct | Meta | 131K | ~175 | $0.40 | $0.40 |
| 37 | Llama 3.1 Nemotron 70B Instruct | NVIDIA | 131K | ~175 | $1.20 | $1.20 |
| 38 | Llama 3.2 11B Vision Instruct | Meta | 131K | ~175 | $0.05 | $0.05 |
| 39 | Llama 3.2 3B Instruct (free) | Meta | 131K | ~175 | Free | Free |
| 40 | Llama 3.3 70B Instruct | Meta | 131K | ~175 | $0.10 | $0.32 |
| 41 | Llama 3.3 Nemotron Super 49B V1.5 | NVIDIA | 131K | ~175 | $0.10 | $0.40 |
| 42 | Llama Guard 3 8B | Meta | 131K | ~175 | $0.02 | $0.06 |
| 43 | LongCat Flash Chat | Meituan | 131K | ~175 | $0.20 | $0.80 |
| 44 | Maestro Reasoning | arcee-ai | 131K | ~175 | $0.90 | $3.30 |
| 45 | Ministral 3 3B 2512 | Mistral AI | 131K | ~175 | $0.10 | $0.10 |
| 46 | Mistral Large 2407 | Mistral AI | 131K | ~175 | $2.00 | $6.00 |
| 47 | Mistral Large 2411 | Mistral AI | 131K | ~175 | $2.00 | $6.00 |
| 48 | Mistral Medium 3 | Mistral AI | 131K | ~175 | $0.40 | $2.00 |
| 49 | Mistral Medium 3.1 | Mistral AI | 131K | ~175 | $0.40 | $2.00 |
| 50 | Mistral Nemo | Mistral AI | 131K | ~175 | $0.02 | $0.04 |
| 51 | Mistral Small 3.2 24B | Mistral AI | 131K | ~175 | $0.06 | $0.18 |
| 52 | Nemotron Nano 12B 2 VL | NVIDIA | 131K | ~175 | $0.20 | $0.60 |
| 53 | Nemotron Nano 9B V2 | NVIDIA | 131K | ~175 | $0.04 | $0.16 |
| 54 | Pixtral Large 2411 | Mistral AI | 131K | ~175 | $2.00 | $6.00 |
| 55 | Qwen VL Max | Alibaba | 131K | ~175 | $0.80 | $3.20 |
| 56 | Qwen VL Plus | Alibaba | 131K | ~175 | $0.14 | $0.41 |
| 57 | Qwen-Turbo | Alibaba | 131K | ~175 | $0.03 | $0.13 |
| 58 | Qwen3 235B A22B | Alibaba | 131K | ~175 | $0.45 | $1.82 |
| 59 | Qwen3 235B A22B Thinking 2507 | Alibaba | 131K | ~175 | Free | Free |
| 60 | Qwen3 VL 235B A22B Thinking | Alibaba | 131K | ~175 | Free | Free |
| 61 | Qwen3 VL 30B A3B Instruct | Alibaba | 131K | ~175 | $0.13 | $0.52 |
| 62 | Qwen3 VL 30B A3B Thinking | Alibaba | 131K | ~175 | Free | Free |
| 63 | Qwen3 VL 32B Instruct | Alibaba | 131K | ~175 | $0.10 | $0.42 |
| 64 | Qwen3 VL 8B Instruct | Alibaba | 131K | ~175 | $0.08 | $0.50 |
| 65 | Qwen3 VL 8B Thinking | Alibaba | 131K | ~175 | $0.12 | $1.36 |
| 66 | R1 Distill Llama 70B | DeepSeek | 131K | ~175 | $0.70 | $0.80 |
| 67 | Spotlight | arcee-ai | 131K | ~175 | $0.18 | $0.18 |
| 68 | Tongyi DeepResearch 30B A3B | Alibaba | 131K | ~175 | $0.09 | $0.45 |
| 69 | Trinity Mini | arcee-ai | 131K | ~175 | $0.04 | $0.15 |
| 70 | Trinity Mini (free) | arcee-ai | 131K | ~175 | Free | Free |
| 71 | Virtuoso Large | arcee-ai | 131K | ~175 | $0.75 | $1.20 |
| 72 | Granite 4.0 Micro | IBM | 131K | ~175 | $0.02 | $0.11 |
| 73 | Kimi K2 0711 | Moonshot AI | 131K | ~175 | $0.55 | $2.20 |
| 74 | Llama 3.1 405B Instruct | Meta | 131K | ~175 | $4.00 | $4.00 |
| 75 | Trinity Large Preview (free) | arcee-ai | 131K | ~175 | Free | Free |
| 76 | Cogito v2.1 671B | deepcogito | 128K | ~171 | $1.25 | $1.25 |
| 77 | Command R (08-2024) | Cohere | 128K | ~171 | $0.15 | $0.60 |
| 78 | Command R+ (08-2024) | Cohere | 128K | ~171 | $2.50 | $10.00 |
| 79 | Command R7B (12-2024) | Cohere | 128K | ~171 | $0.04 | $0.15 |
| 80 | Gemma 3 27B | 128K | ~171 | $0.04 | $0.15 | |
| 81 | GPT Audio | OpenAI | 128K | ~171 | $2.50 | $10.00 |
| 82 | GPT Audio Mini | OpenAI | 128K | ~171 | $0.60 | $2.40 |
| 83 | GPT-4 Turbo | OpenAI | 128K | ~171 | $10.00 | $30.00 |
| 84 | GPT-4 Turbo (older v1106) | OpenAI | 128K | ~171 | $10.00 | $30.00 |
| 85 | GPT-4 Turbo Preview | OpenAI | 128K | ~171 | $10.00 | $30.00 |
| 86 | GPT-4o | OpenAI | 128K | ~171 | $2.50 | $10.00 |
| 87 | GPT-4o (2024-05-13) | OpenAI | 128K | ~171 | $5.00 | $15.00 |
| 88 | GPT-4o (2024-08-06) | OpenAI | 128K | ~171 | $2.50 | $10.00 |
| 89 | GPT-4o (2024-11-20) | OpenAI | 128K | ~171 | $2.50 | $10.00 |
| 90 | GPT-4o (extended) | OpenAI | 128K | ~171 | $6.00 | $18.00 |
| 91 | GPT-4o Audio | OpenAI | 128K | ~171 | $2.50 | $10.00 |
| 92 | GPT-4o Search Preview | OpenAI | 128K | ~171 | $2.50 | $10.00 |
| 93 | GPT-4o-mini | OpenAI | 128K | ~171 | $0.15 | $0.60 |
| 94 | GPT-4o-mini (2024-07-18) | OpenAI | 128K | ~171 | $0.15 | $0.60 |
| 95 | GPT-4o-mini Search Preview | OpenAI | 128K | ~171 | $0.15 | $0.60 |
| 96 | GPT-5 Chat | OpenAI | 128K | ~171 | $1.25 | $10.00 |
| 97 | GPT-5.1 Chat | OpenAI | 128K | ~171 | $1.25 | $10.00 |
| 98 | GPT-5.2 Chat | OpenAI | 128K | ~171 | $1.75 | $14.00 |
| 99 | GPT-5.3 Chat | OpenAI | 128K | ~171 | $1.75 | $14.00 |
| 100 | Llama 3.3 70B Instruct (free) | Meta | 128K | ~171 | Free | Free |
| 101 | Mercury | Inception | 128K | ~171 | $0.25 | $0.75 |
| 102 | Mercury Coder | Inception | 128K | ~171 | $0.25 | $0.75 |
| 103 | Mistral Large | Mistral AI | 128K | ~171 | $2.00 | $6.00 |
| 104 | Mistral Small 3.1 24B | Mistral AI | 128K | ~171 | $0.35 | $0.56 |
| 105 | Mistral Small 3.1 24B (free) | Mistral AI | 128K | ~171 | Free | Free |
| 106 | Nemotron Nano 12B 2 VL (free) | NVIDIA | 128K | ~171 | Free | Free |
| 107 | Nemotron Nano 9B V2 (free) | NVIDIA | 128K | ~171 | Free | Free |
| 108 | Nova Micro 1.0 | Amazon | 128K | ~171 | $0.04 | $0.14 |
| 109 | Olmo 2 32B Instruct | Allen AI | 128K | ~171 | $0.05 | $0.20 |
| 110 | Qwen2.5 VL 32B Instruct | Alibaba | 128K | ~171 | $0.20 | $0.60 |
| 111 | Qwen3 Next 80B A3B Thinking | Alibaba | 128K | ~171 | $0.15 | $1.20 |
| 112 | Solar Pro 3 | Upstage | 128K | ~171 | $0.15 | $0.60 |
| 113 | Sonar Deep Research | Perplexity | 128K | ~171 | $2.00 | $8.00 |
| 114 | Sonar Reasoning Pro | Perplexity | 128K | ~171 | $2.00 | $8.00 |
| 115 | UI-TARS 7B | ByteDance | 128K | ~171 | $0.10 | $0.20 |
Moderate context windows suitable for shorter documents, code files, and focused conversations.
| # | Model | Provider | Context Window | Pages | Input / 1M | Output / 1M |
|---|---|---|---|---|---|---|
| 1 | Sonar | Perplexity | 127K | ~169 | $1.00 | $1.00 |
| 2 | ERNIE 4.5 300B A47B | Baidu | 123K | ~164 | $0.28 | $1.10 |
| 3 | ERNIE 4.5 VL 424B A47B | Baidu | 123K | ~164 | $0.42 | $1.25 |
| 4 | ERNIE 4.5 21B A3B | Baidu | 120K | ~160 | $0.07 | $0.28 |
| 5 | Llama 3.2 3B Instruct | Meta | 80K | ~107 | $0.05 | $0.34 |
| 6 | MiniMax M2-her | MiniMax | 66K | ~87 | $0.30 | $1.20 |
| 7 | Mixtral 8x22B Instruct | Mistral AI | 66K | ~87 | $2.00 | $6.00 |
| 8 | Nano Banana 2 (Gemini 3.1 Flash Image Preview) | 66K | ~87 | $0.50 | $3.00 | |
| 9 | Nano Banana Pro (Gemini 3 Pro Image Preview) | 66K | ~87 | $2.00 | $12.00 | |
| 10 | Olmo 3 32B Think | Allen AI | 66K | ~87 | $0.15 | $0.50 |
| 11 | Olmo 3 7B Instruct | Allen AI | 66K | ~87 | $0.10 | $0.20 |
| 12 | Olmo 3 7B Think | Allen AI | 66K | ~87 | $0.12 | $0.20 |
| 13 | Olmo 3.1 32B Instruct | Allen AI | 66K | ~87 | $0.20 | $0.60 |
| 14 | Olmo 3.1 32B Think | Allen AI | 66K | ~87 | $0.15 | $0.50 |
| 15 | WizardLM-2 8x22B | Microsoft | 66K | ~87 | $0.62 | $0.62 |
| 16 | R1 | DeepSeek | 64K | ~85 | $0.70 | $2.50 |
| 17 | Llama 3.2 1B Instruct | Meta | 60K | ~80 | $0.03 | $0.20 |
| 18 | Qwen3 14B | Alibaba | 41K | ~55 | $0.06 | $0.24 |
| 19 | Qwen3 30B A3B | Alibaba | 41K | ~55 | $0.08 | $0.28 |
| 20 | Qwen3 32B | Alibaba | 41K | ~55 | $0.08 | $0.24 |
| 21 | Qwen3 4B (free) | Alibaba | 41K | ~55 | Free | Free |
| 22 | Qwen3 8B | Alibaba | 41K | ~55 | $0.05 | $0.40 |
| 23 | Molmo2 8B | Allen AI | 37K | ~49 | $0.20 | $0.20 |
| 24 | Coder Large | arcee-ai | 33K | ~44 | $0.50 | $0.80 |
| 25 | DeepSeek V3.1 | DeepSeek | 33K | ~44 | $0.15 | $0.75 |
| 26 | Gemma 3 12B (free) | 33K | ~44 | Free | Free | |
| 27 | Gemma 3 4B (free) | 33K | ~44 | Free | Free | |
| 28 | Gemma 3n 4B | 33K | ~44 | $0.02 | $0.04 | |
| 29 | LFM2-2.6B | Liquid AI | 33K | ~44 | $0.01 | $0.02 |
| 30 | LFM2-24B-A2B | Liquid AI | 33K | ~44 | $0.03 | $0.12 |
| 31 | LFM2-8B-A1B | Liquid AI | 33K | ~44 | $0.01 | $0.02 |
| 32 | LFM2.5-1.2B-Instruct (free) | Liquid AI | 33K | ~44 | Free | Free |
| 33 | LFM2.5-1.2B-Thinking (free) | Liquid AI | 33K | ~44 | Free | Free |
| 34 | Llama 3.1 405B (base) | Meta | 33K | ~44 | $4.00 | $4.00 |
| 35 | Mistral Small 3 | Mistral AI | 33K | ~44 | $0.05 | $0.08 |
| 36 | Mistral Small Creative | Mistral AI | 33K | ~44 | $0.10 | $0.30 |
| 37 | Mixtral 8x7B Instruct | Mistral AI | 33K | ~44 | $0.54 | $0.54 |
| 38 | Nano Banana (Gemini 2.5 Flash Image) | 33K | ~44 | $0.30 | $2.50 | |
| 39 | Qwen-Max | Alibaba | 33K | ~44 | $1.04 | $4.16 |
| 40 | Qwen2.5 72B Instruct | Alibaba | 33K | ~44 | $0.12 | $0.39 |
| 41 | Qwen2.5 7B Instruct | Alibaba | 33K | ~44 | $0.04 | $0.10 |
| 42 | Qwen2.5 Coder 32B Instruct | Alibaba | 33K | ~44 | $0.20 | $0.20 |
| 43 | Qwen2.5 Coder 7B Instruct | Alibaba | 33K | ~44 | $0.03 | $0.09 |
| 44 | Qwen2.5 VL 72B Instruct | Alibaba | 33K | ~44 | $0.80 | $0.80 |
| 45 | Qwen2.5-VL 7B Instruct | Alibaba | 33K | ~44 | $0.20 | $0.20 |
| 46 | Qwen3 30B A3B Thinking 2507 | Alibaba | 33K | ~44 | $0.05 | $0.34 |
| 47 | QwQ 32B | Alibaba | 33K | ~44 | $0.15 | $0.40 |
| 48 | R1 Distill Qwen 32B | DeepSeek | 33K | ~44 | $0.29 | $0.29 |
| 49 | Rnj 1 Instruct | essentialai | 33K | ~44 | $0.15 | $0.15 |
| 50 | Saba | Mistral AI | 33K | ~44 | $0.20 | $0.60 |
| 51 | Voxtral Small 24B 2507 | Mistral AI | 32K | ~43 | $0.10 | $0.30 |
A model's context window is the total number of tokens (roughly words) it can process in a single request. This includes both your input prompt and the model's output. A model with a 128K context window can process about 170 pages of text at once, while a 1M-token model can handle roughly 1,300 pages -- enough for entire books or large codebases.
With small context windows (under 32K), you must chunk documents and use retrieval-augmented generation (RAG). Large context models eliminate this complexity for many workloads: analyzing full legal contracts, reviewing entire repositories, summarizing research paper collections, or maintaining very long conversations with full history retained.
Not all context is created equal. Some models perform well on "needle in a haystack" tests at their full context length, while others degrade on information retrieval when prompts get very long. The advertised context window is the maximum, but effective performance may vary. Check our leaderboard for quality scores that account for real-world performance.
Using a large context window means sending more tokens per request, which increases cost. For example, filling a 1M-token context at $3/1M input tokens costs $3 per request. For cost-sensitive workloads, consider whether RAG with a smaller context model might be more efficient than filling a large context window end to end.
Dive deeper into model capabilities, compare context windows side by side, or see overall rankings across all dimensions.