Best AI for Translation

The top AI models for translation, ranked by quality and cost-effectiveness. Translation is volume-heavy — large documents, many language pairs, and real-time demands — so context window size, streaming support, and affordable pricing matter most. Compare the best LLM translation models for documents, websites, and multilingual content.

293

Total Models

293

With Streaming

225

128K+ Context

Free Options

AI Translation Models — Ranked by Translation Score

#	Model	Provider	Score	Context	$/1M In	$/1M Out
1	GPT-5.4 ProOpenAI	OpenAI	101	1.1M	$30.00	$180.00
2	GPT-5.2 ProOpenAI	OpenAI	97	400K	$21.00	$168.00
3	GPT-5 ProOpenAI	OpenAI	97	400K	$15.00	$120.00
4	o3 ProOpenAI	OpenAI	89	200K	$20.00	$80.00
5	Claude Opus 4.1Anthropic	Anthropic	88	200K	$15.00	$75.00
6	o1-proOpenAI	OpenAI	84	200K	$150.00	$600.00
7	Claude Opus 4Anthropic	Anthropic	83	200K	$15.00	$75.00
8	o3 Deep ResearchOpenAI	OpenAI	81	200K	$10.00	$40.00
9	Claude Opus 4.6Anthropic	Anthropic	81	1M	$5.00	$25.00
10	GPT-5.4OpenAI	OpenAI	80	1.1M	$2.50	$15.00
11	Claude Sonnet 4.5Anthropic	Anthropic	79	1M	$3.00	$15.00
12	Gemini 3.1 Pro Preview Custom ToolsGoogle	Google	78	1.0M	$2.00	$12.00
13	Gemini 3.1 Pro PreviewGoogle	Google	78	1.0M	$2.00	$12.00
14	Gemini 3 Pro PreviewGoogle	Google	78	1.0M	$2.00	$12.00
15	Claude Sonnet 4.6Anthropic	Anthropic	78	1M	$3.00	$15.00
16	Claude Opus 4.5Anthropic	Anthropic	77	200K	$5.00	$25.00
17	Gemini 3 Flash PreviewGoogle	Google	76	1.0M	$0.50	$3.00
18	Gemini 3.1 Flash Lite PreviewGoogle	Google	76	1.0M	$0.25	$1.50
19	Gemini 2.5 ProGoogle	Google	76	1.0M	$1.25	$10.00
20	GPT-5.2OpenAI	OpenAI	75	400K	$1.75	$14.00
21	Gemini 2.5 Flash Lite Preview 09-2025Google	Google	75	1.0M	$0.10	$0.40
22	GPT-5.1OpenAI	OpenAI	74	400K	$1.25	$10.00
23	Gemini 2.5 Pro Preview 05-06Google	Google	74	1.0M	$1.25	$10.00
24	Gemini 2.5 Flash LiteGoogle	Google	74	1.0M	$0.10	$0.40
25	Grok 4.1 FastxAI	xAI	74	2M	$0.20	$0.50
26	Grok 4 FastxAI	xAI	74	2M	$0.20	$0.50
27	GPT-5.3-CodexOpenAI	OpenAI	74	400K	$1.75	$14.00
28	GPT-5.2-CodexOpenAI	OpenAI	74	400K	$1.75	$14.00
29	Qwen3 VL 30B A3B ThinkingAlibaba	Alibaba	74	131K	Free	Free
30	Qwen3 VL 235B A22B ThinkingAlibaba	Alibaba	74	131K	Free	Free

Why LLMs Are Replacing Traditional Translation

Context-Aware Translation

Traditional machine translation (like early Google Translate) works sentence by sentence. LLMs process entire documents at once, understanding context, tone, and intent across paragraphs. This produces translations that read naturally rather than sounding mechanical — especially for idiomatic expressions, humor, and culturally-specific references.

Handling Ambiguity and Nuance

Many words have multiple meanings depending on context. "Bank" can mean a financial institution or a river bank. LLMs use the surrounding text to disambiguate automatically. They also handle gendered languages, formal/informal registers, and domain-specific terminology far better than rule-based systems.

Flexible Output Styles

You can instruct an LLM to translate formally, casually, or for a specific audience. Need a legal contract translated with precise terminology? Or a marketing slogan localized for a specific culture? LLMs adapt to the target register in ways that traditional systems cannot.

Multi-Language in One Model

A single LLM like GPT-4o or Claude handles hundreds of language pairs without switching systems. You can translate from Japanese to Portuguese, then Spanish to Mandarin, all in the same API call. This simplifies architecture for apps that need to support many languages simultaneously.

Choosing the Right Model for Translation

Real-Time Translation

For chat apps, live subtitles, or customer support, streaming matters most. Models with streaming support begin outputting translated text as they process, reducing perceived latency. Look for the streaming column in the table above and prioritize models with fast time-to-first-token.

Document Translation

Translating long documents (contracts, manuals, books) requires large context windows. A 128K context window handles roughly 100 pages in one pass. For longer documents, look for models with 200K+ or 1M context. Single-pass translation preserves cross-references, terminology consistency, and tone throughout the document.

High-Volume / Budget Translation

Translation workloads often involve millions of tokens — product catalogs, website localization, or user-generated content. For these, total cost per million tokens (input + output) dominates. Free and budget models work well for common language pairs. Reserve premium models for low-resource languages or content requiring nuanced quality.

Low-Resource Languages

For languages with less training data (e.g., Swahili, Khmer, Welsh), higher-quality models with larger parameter counts tend to perform significantly better. Budget models may produce acceptable results for English-French, but struggle with less common language pairs. Test with your target languages before committing.