AI models ranked by multilingual performance using MMLU benchmark scores across languages. Find the best LLM for translation and non-English tasks.
Gemini 2.5 Pro Preview 05-06
Score: 91
70.5
Across all ranked models
58
With benchmark data
Each model's score is a weighted average of its available benchmark results. When a model is missing some benchmarks, the weights are re-normalized across the benchmarks that are available. All scores are on a 0-100 scale. Data sourced from official model cards, published papers, and third-party evaluation platforms.
Based on our benchmark analysis, Gemini 2.5 Pro Preview 05-06 by Google is currently the #1 ranked model for multilingual, with a weighted score of 91/100.
Models are ranked using a weighted average of MMLU, Arena Elo benchmark scores. All scores are normalized to a 0-100 scale.
We currently rank 58 models that have relevant benchmark data for multilingual tasks.