Weekly AI Performance Report

Week of March 23, 2026

140 models improved, 145 declined, and 15 unchanged this week across 300 tracked models.

Executive Summary

Biggest Gainer

GPT-5.4 Mini

+301 ranks

Currently ranked #3

Biggest Loser

GPT-5 Codex

-27 ranks

Currently ranked #43

New Entries

new models entered rankings this week

Weekly Movers

Top 10 models with the largest absolute rank changes over the past 7 days.

Model	Provider	Score	7d Change	Rank	State
GPT-5.4 MiniOpenAI	OpenAI	93.3	+301	#3	preliminary
MiMo-V2-OmniXiaomi	Xiaomi	85.0	+282	#22	preliminary
MiMo-V2-ProXiaomi	Xiaomi	85.0	+281	#23	preliminary
GPT-5.4 NanoOpenAI	OpenAI	85.0	+280	#24	preliminary
MiniMax M2.7MiniMax	MiniMax	83.0	+251	#53	preliminary
Mistral Small 4Mistral AI	Mistral AI	79.4	+225	#79	preliminary
GPT-5.1OpenAI	OpenAI	85.2	+29	#21	fragile
GPT-5 CodexOpenAI	OpenAI	85.0	-27	#43	fragile
Hunyuan A13B InstructTencent	Tencent	72.3	-27	#141	fragile
o4 MiniOpenAI	OpenAI	83.7	-23	#50	fragile

Top Gainers

Models with the largest positive rank improvements this week.

GPT-5.4 MiniOpenAI

+301

MiMo-V2-OmniXiaomi

+282

MiMo-V2-ProXiaomi

+281

GPT-5.4 NanoOpenAI

+280

MiniMax M2.7MiniMax

+251

Mistral Small 4Mistral AI

+225

GPT-5.1OpenAI

+29

Seed 1.6 FlashByteDance

+21

gpt-oss-20b (free)OpenAI

+21

Nemotron 3 SuperNVIDIA

+21

Top Losers

Models with the largest negative rank drops this week.

GPT-5 CodexOpenAI

-27

Hunyuan A13B InstructTencent

-27

o4 MiniOpenAI

-23

Ministral 3 14B 2512Mistral AI

-21

Nova 2 LiteAmazon

-21

Gemini 2.5 ProGoogle

-20

Qwen3 Next 80B A3B ThinkingAlibaba

-20

Grok 4 FastxAI

-19

ERNIE 4.5 21B A3B ThinkingBaidu

-19

GPT-5.1-Codex-MiniOpenAI

-18

New Models

10 new models entered the rankings this week.

Model	Provider	Score	Rank
GPT-5.4 MiniOpenAI	OpenAI	93.3	#3
MiMo-V2-OmniXiaomi	Xiaomi	85.0	#22
MiMo-V2-ProXiaomi	Xiaomi	85.0	#23
GPT-5.4 NanoOpenAI	OpenAI	85.0	#24
MiniMax M2.7MiniMax	MiniMax	83.0	#53
Mistral Small 4Mistral AI	Mistral AI	79.4	#79
GPT-5.1OpenAI	OpenAI	85.2	#21
Seed 1.6 FlashByteDance	ByteDance	85.0	#33
gpt-oss-20b (free)OpenAI	OpenAI	73.8	#117
Nemotron 3 SuperNVIDIA	NVIDIA	73.5	#120

Watch List

Models in a fragile state that might degrade further. Monitor closely.

Model	Provider	Score	7d Change	State
o3 ProOpenAI	OpenAI	87.7	+10	fragile
Grok 4xAI	xAI	85.8	+20	fragile
Grok 4.20 BetaxAI	xAI	85.7	+12	fragile
o3OpenAI	OpenAI	85.7	-6	fragile
GPT-5.1OpenAI	OpenAI	85.2	+29	fragile
Seed-2.0-LiteByteDance	ByteDance	85.0	+6	fragile
GPT-5.3-CodexOpenAI	OpenAI	85.0	+9	fragile
Qwen3.5 Plus 2026-02-15Alibaba	Alibaba	85.0	+9	fragile
Kimi K2.5Moonshot AI	Moonshot AI	85.0	+10	fragile
Seed 1.6 FlashByteDance	ByteDance	85.0	+21	fragile
Seed 1.6ByteDance	ByteDance	85.0	+15	fragile
GPT-5.1-Codex-MaxOpenAI	OpenAI	85.0	-13	fragile
GPT-5.1 ChatOpenAI	OpenAI	85.0	+10	fragile
GPT-5.1-CodexOpenAI	OpenAI	85.0	+11	fragile
GPT-5.1-Codex-MiniOpenAI	OpenAI	85.0	-18	fragile
Qwen3 VL 8B ThinkingAlibaba	Alibaba	85.0	-8	fragile
Qwen3 VL 30B A3B ThinkingAlibaba	Alibaba	85.0	+11	fragile
GPT-5 CodexOpenAI	OpenAI	85.0	-27	fragile
o4 Mini HighOpenAI	OpenAI	85.0	-15	fragile
Grok Code Fast 1xAI	xAI	84.8	+10	fragile
Gemini 2.5 ProGoogle	Google	84.8	-20	fragile
Gemini 2.5 Pro Preview 06-05Google	Google	84.3	+15	fragile
Gemini 2.5 Flash Lite Preview 09-2025Google	Google	83.7	+11	fragile
o4 MiniOpenAI	OpenAI	83.7	-23	fragile
MiniMax M2.5 (free)MiniMax	MiniMax	83.4	+6	fragile
Grok 4 FastxAI	xAI	83.3	-19	fragile
GPT-5.2 ChatOpenAI	OpenAI	82.9	+17	fragile
Qwen Plus 0728 (thinking)Alibaba	Alibaba	82.8	-14	fragile
Gemini 2.5 Pro Preview 05-06Google	Google	82.7	+14	fragile
MiMo-V2-FlashXiaomi	Xiaomi	82.6	-15	fragile
Trinity Miniarcee-ai	arcee-ai	82.4	-15	fragile
Nemotron Nano 12B 2 VL (free)NVIDIA	NVIDIA	82.3	-15	fragile
Grok 4.20 Multi-Agent BetaxAI	xAI	82.2	+17	fragile
Claude Opus 4.1Anthropic	Anthropic	82.0	-11	fragile
Gemini 3.1 Flash Lite PreviewGoogle	Google	81.9	+11	fragile
Qwen3.5 397B A17BAlibaba	Alibaba	81.8	-7	fragile
Claude Opus 4Anthropic	Anthropic	81.7	-11	fragile
Mercury 2Inception	Inception	81.3	+9	fragile
Qwen3 VL 32B InstructAlibaba	Alibaba	80.9	-12	fragile
Qwen3 VL 8B InstructAlibaba	Alibaba	80.9	+17	fragile
Qwen3 30B A3B Thinking 2507Alibaba	Alibaba	80.9	+9	fragile
Gemini 2.5 FlashGoogle	Google	80.1	-15	fragile
Claude Sonnet 4Anthropic	Anthropic	79.9	-11	fragile
Qwen3.5-122B-A10BAlibaba	Alibaba	79.7	+6	fragile
Qwen3.5-FlashAlibaba	Alibaba	79.4	-11	fragile
Qwen3.5-9BAlibaba	Alibaba	79.3	+14	fragile
Qwen3.5-27BAlibaba	Alibaba	79.1	-16	fragile
Qwen3 Coder PlusAlibaba	Alibaba	78.6	-7	fragile
Qwen3.5-35B-A3BAlibaba	Alibaba	78.3	+18	fragile
Step 3.5 Flash (free)StepFun	StepFun	78.2	-10	fragile
Nova Premier 1.0Amazon	Amazon	77.8	-8	fragile
KAT-Coder-Pro V1Kuaishou	Kuaishou	77.4	+16	fragile
Qwen3 VL 235B A22B ThinkingAlibaba	Alibaba	77.4	-10	fragile
GPT-4.1 MiniOpenAI	OpenAI	77.4	+15	fragile
DeepSeek V3.2 ExpDeepSeek	DeepSeek	77.2	-7	fragile
DeepSeek V3.2 SpecialeDeepSeek	DeepSeek	77.1	-9	fragile
Qwen Plus 0728Alibaba	Alibaba	77.0	+14	fragile
Qwen3 Coder NextAlibaba	Alibaba	76.7	-10	fragile
o1-proOpenAI	OpenAI	76.5	+16	fragile
MiniMax M2.5MiniMax	MiniMax	76.0	+14	fragile
Qwen3 MaxAlibaba	Alibaba	76.0	-12	fragile
o1OpenAI	OpenAI	75.7	-15	fragile
GPT-5 NanoOpenAI	OpenAI	75.6	+17	fragile
Qwen3 30B A3B Instruct 2507Alibaba	Alibaba	75.2	+6	fragile
ERNIE 4.5 VL 28B A3BBaidu	Baidu	75.0	+14	fragile
GPT-5 ChatOpenAI	OpenAI	75.0	+11	fragile
DeepSeek V3.2DeepSeek	DeepSeek	74.1	-14	fragile
Qwen3 VL 235B A22B InstructAlibaba	Alibaba	74.0	-7	fragile
DeepSeek V3.1DeepSeek	DeepSeek	73.8	-13	fragile
gpt-oss-120b (free)OpenAI	OpenAI	73.8	-12	fragile
gpt-oss-20b (free)OpenAI	OpenAI	73.8	+21	fragile
DeepSeek V3.1 TerminusDeepSeek	DeepSeek	73.7	+10	fragile
Grok 3xAI	xAI	73.7	+14	fragile
Nemotron 3 SuperNVIDIA	NVIDIA	73.5	+21	fragile
Ministral 3 14B 2512Mistral AI	Mistral AI	73.5	-21	fragile
Mistral Large 3 2512Mistral AI	Mistral AI	73.5	-7	fragile
o3 MiniOpenAI	OpenAI	73.4	+11	fragile
DeepSeek V3 0324DeepSeek	DeepSeek	73.2	-7	fragile
MiniMax M2.1MiniMax	MiniMax	73.1	+20	fragile
GPT-4o-mini Search PreviewOpenAI	OpenAI	72.9	-8	fragile
LongCat Flash ChatMeituan	Meituan	72.8	+16	fragile
Nova 2 LiteAmazon	Amazon	72.7	-21	fragile
MiniMax M2MiniMax	MiniMax	72.7	+19	fragile
Qwen3 Next 80B A3B ThinkingAlibaba	Alibaba	72.7	-20	fragile
Trinity Large Preview (free)arcee-ai	arcee-ai	72.6	+21	fragile
Ministral 3 3B 2512Mistral AI	Mistral AI	72.6	+21	fragile
Trinity Mini (free)arcee-ai	arcee-ai	72.6	+8	fragile
Nemotron Nano 12B 2 VLNVIDIA	NVIDIA	72.6	-15	fragile
Hunyuan A13B InstructTencent	Tencent	72.3	-27	fragile
GPT-4o AudioOpenAI	OpenAI	72.1	-13	fragile
Llama 4 ScoutMeta	Meta	72.0	-12	fragile
Nemotron Nano 9B V2NVIDIA	NVIDIA	71.6	-18	fragile
Qwen3 32BAlibaba	Alibaba	71.4	-16	fragile
Jamba Large 1.7AI21 Labs	AI21 Labs	71.2	-7	fragile
Mistral Medium 3.1Mistral AI	Mistral AI	70.3	+13	fragile
Qwen3 Next 80B A3B InstructAlibaba	Alibaba	70.1	+7	fragile
ERNIE 4.5 21B A3B ThinkingBaidu	Baidu	70.0	-19	fragile
Claude 3.7 Sonnet (thinking)Anthropic	Anthropic	69.8	+8	fragile
DeepSeek V3DeepSeek	DeepSeek	69.7	-17	fragile
Aion-2.0aion-labs	aion-labs	69.2	-9	fragile
Qwen3 Coder 480B A35B (free)Alibaba	Alibaba	69.0	+9	fragile
Llama 3.3 Nemotron Super 49B V1.5NVIDIA	NVIDIA	68.6	-8	fragile
GPT AudioOpenAI	OpenAI	68.4	-6	fragile
GPT Audio MiniOpenAI	OpenAI	68.4	-12	fragile
MiniMax M1MiniMax	MiniMax	68.4	+11	fragile
Nemotron 3 Nano 30B A3B (free)NVIDIA	NVIDIA	67.7	+16	fragile
gpt-oss-120bOpenAI	OpenAI	67.7	-12	fragile
Mistral Small 3.2 24BMistral AI	Mistral AI	67.3	+10	fragile
Cogito v2.1 671Bdeepcogito	deepcogito	66.7	+6	fragile
Olmo 3 32B ThinkAllen AI	Allen AI	66.3	-8	fragile
Grok 3 Mini BetaxAI	xAI	66.1	+9	fragile
Claude 3.5 SonnetAnthropic	Anthropic	65.8	-8	fragile
Llama 3.3 70B InstructMeta	Meta	65.7	-15	fragile
o3 Mini HighOpenAI	OpenAI	65.4	+21	fragile
ERNIE 4.5 21B A3BBaidu	Baidu	65.2	+15	fragile
Mistral Medium 3Mistral AI	Mistral AI	65.0	-6	fragile
Olmo 3.1 32B InstructAllen AI	Allen AI	64.9	-13	fragile
Codestral 2508Mistral AI	Mistral AI	64.8	-17	fragile
GPT-4o-miniOpenAI	OpenAI	64.6	+10	fragile
GPT-4oOpenAI	OpenAI	64.4	+6	fragile
Gemma 3 27BGoogle	Google	63.6	+11	fragile
GPT-4o (2024-11-20)OpenAI	OpenAI	63.3	+8	fragile
Sonar ProPerplexity	Perplexity	63.1	+8	fragile
Qwen3 4B (free)Alibaba	Alibaba	63.0	-16	fragile
Gemma 3 27B (free)Google	Google	62.8	-6	fragile
Kimi K2 0711Moonshot AI	Moonshot AI	62.7	-17	fragile
Devstral Small 1.1Mistral AI	Mistral AI	62.6	-11	fragile
Spotlightarcee-ai	arcee-ai	62.3	-15	fragile
Mistral Small 3.1 24B (free)Mistral AI	Mistral AI	62.2	+11	fragile
MiniMax-01MiniMax	MiniMax	62.0	-6	fragile
Sonar Reasoning ProPerplexity	Perplexity	61.6	+8	fragile
R1 Distill Llama 70BDeepSeek	DeepSeek	61.0	+8	fragile
Qwen VL PlusAlibaba	Alibaba	60.9	+8	fragile
Qwen2.5 VL 72B InstructAlibaba	Alibaba	60.3	+10	fragile
R1 Distill Qwen 32BDeepSeek	DeepSeek	60.2	+12	fragile
Gemma 2 27BGoogle	Google	59.7	-6	fragile
Phi 4Microsoft	Microsoft	59.6	+6	fragile
Mistral Small 3Mistral AI	Mistral AI	59.5	+6	fragile
MiniMax M2-herMiniMax	MiniMax	59.4	-8	fragile
LFM2.5-1.2B-Thinking (free)Liquid AI	Liquid AI	59.0	-7	fragile
Mistral Small CreativeMistral AI	Mistral AI	59.0	-17	fragile
Llama Guard 4 12BMeta	Meta	59.0	-17	fragile
Llama 3.1 Nemotron Ultra 253B v1NVIDIA	NVIDIA	57.5	-6	fragile
Qwen2.5 VL 32B InstructAlibaba	Alibaba	56.7	+15	fragile
Aion-1.0aion-labs	aion-labs	56.6	+15	fragile
Aion-1.0-Miniaion-labs	aion-labs	56.6	+6	fragile
Gemma 3 4BGoogle	Google	56.2	-12	fragile
GPT-4o (2024-08-06)OpenAI	OpenAI	55.6	+12	fragile
GPT-4o (extended)OpenAI	OpenAI	54.3	-10	fragile
Mistral LargeMistral AI	Mistral AI	54.1	-9	fragile
SonarPerplexity	Perplexity	53.7	+7	fragile
LFM2-8B-A1BLiquid AI	Liquid AI	53.2	+9	fragile
LFM2-2.6BLiquid AI	Liquid AI	53.2	-16	fragile
Mistral Large 2407Mistral AI	Mistral AI	53.0	-8	fragile
Claude 3 HaikuAnthropic	Anthropic	43.0	+9	fragile
Llama Guard 3 8BMeta	Meta	42.9	+6	fragile
GPT-4 Turbo (older v1106)OpenAI	OpenAI	42.7	+6	fragile
Llama 3.1 8B InstructMeta	Meta	42.4	-6	fragile
GPT-4OpenAI	OpenAI	39.0	+6	fragile

Leaderboard Snapshot

Top 10 AI models this week by composite score.

#	Model	Provider	Score	7d Change
1	GPT-5.4 ProOpenAI	OpenAI	94.0	0
2	GPT-5.4OpenAI	OpenAI	94.0	+4
3	GPT-5.4 MiniOpenAI	OpenAI	93.3	+301
4	GPT-5.2 ProOpenAI	OpenAI	92.7	-1
5	GPT-5.2OpenAI	OpenAI	92.7	+4
6	Claude Opus 4.6Anthropic	Anthropic	92.1	-1
7	GPT-5 ProOpenAI	OpenAI	91.9	0
8	o3 Deep ResearchOpenAI	OpenAI	91.5	-4
9	Claude Opus 4.5Anthropic	Anthropic	90.4	-1
10	Gemini 3 Pro PreviewGoogle	Google	90.3	+1

This Week's Numbers

Total Ranked

300

Avg Score

68.2

Top 10 Moved

8/10

Most Active Provider

OpenAI

Weekly AI Performance Report

Week of March 23, 2026

140 models improved, 145 declined, and 15 unchanged this week across 300 tracked models.

Executive Summary

Biggest Gainer

GPT-5.4 Mini

+301 ranks

Currently ranked #3

Biggest Loser

GPT-5 Codex

-27 ranks

Currently ranked #43

New Entries

new models entered rankings this week

Weekly Movers

Top 10 models with the largest absolute rank changes over the past 7 days.

Model	Provider	Score	7d Change	Rank	State
GPT-5.4 MiniOpenAI	OpenAI	93.3	+301	#3	preliminary
MiMo-V2-OmniXiaomi	Xiaomi	85.0	+282	#22	preliminary
MiMo-V2-ProXiaomi	Xiaomi	85.0	+281	#23	preliminary
GPT-5.4 NanoOpenAI	OpenAI	85.0	+280	#24	preliminary
MiniMax M2.7MiniMax	MiniMax	83.0	+251	#53	preliminary
Mistral Small 4Mistral AI	Mistral AI	79.4	+225	#79	preliminary
GPT-5.1OpenAI	OpenAI	85.2	+29	#21	fragile
GPT-5 CodexOpenAI	OpenAI	85.0	-27	#43	fragile
Hunyuan A13B InstructTencent	Tencent	72.3	-27	#141	fragile
o4 MiniOpenAI	OpenAI	83.7	-23	#50	fragile

Top Gainers

Models with the largest positive rank improvements this week.

GPT-5.4 MiniOpenAI

+301

MiMo-V2-OmniXiaomi

+282

MiMo-V2-ProXiaomi

+281

GPT-5.4 NanoOpenAI

+280

MiniMax M2.7MiniMax

+251

Mistral Small 4Mistral AI

+225

GPT-5.1OpenAI

+29

Seed 1.6 FlashByteDance

+21

gpt-oss-20b (free)OpenAI

+21

Nemotron 3 SuperNVIDIA

+21

Top Losers

Models with the largest negative rank drops this week.

GPT-5 CodexOpenAI

-27

Hunyuan A13B InstructTencent

-27

o4 MiniOpenAI

-23

Ministral 3 14B 2512Mistral AI

-21

Nova 2 LiteAmazon

-21

Gemini 2.5 ProGoogle

-20

Qwen3 Next 80B A3B ThinkingAlibaba

-20

Grok 4 FastxAI

-19

ERNIE 4.5 21B A3B ThinkingBaidu

-19

GPT-5.1-Codex-MiniOpenAI

-18

New Models

10 new models entered the rankings this week.

Model	Provider	Score	Rank
GPT-5.4 MiniOpenAI	OpenAI	93.3	#3
MiMo-V2-OmniXiaomi	Xiaomi	85.0	#22
MiMo-V2-ProXiaomi	Xiaomi	85.0	#23
GPT-5.4 NanoOpenAI	OpenAI	85.0	#24
MiniMax M2.7MiniMax	MiniMax	83.0	#53
Mistral Small 4Mistral AI	Mistral AI	79.4	#79
GPT-5.1OpenAI	OpenAI	85.2	#21
Seed 1.6 FlashByteDance	ByteDance	85.0	#33
gpt-oss-20b (free)OpenAI	OpenAI	73.8	#117
Nemotron 3 SuperNVIDIA	NVIDIA	73.5	#120

Watch List

Models in a fragile state that might degrade further. Monitor closely.

Model	Provider	Score	7d Change	State
o3 ProOpenAI	OpenAI	87.7	+10	fragile
Grok 4xAI	xAI	85.8	+20	fragile
Grok 4.20 BetaxAI	xAI	85.7	+12	fragile
o3OpenAI	OpenAI	85.7	-6	fragile
GPT-5.1OpenAI	OpenAI	85.2	+29	fragile
Seed-2.0-LiteByteDance	ByteDance	85.0	+6	fragile
GPT-5.3-CodexOpenAI	OpenAI	85.0	+9	fragile
Qwen3.5 Plus 2026-02-15Alibaba	Alibaba	85.0	+9	fragile
Kimi K2.5Moonshot AI	Moonshot AI	85.0	+10	fragile
Seed 1.6 FlashByteDance	ByteDance	85.0	+21	fragile
Seed 1.6ByteDance	ByteDance	85.0	+15	fragile
GPT-5.1-Codex-MaxOpenAI	OpenAI	85.0	-13	fragile
GPT-5.1 ChatOpenAI	OpenAI	85.0	+10	fragile
GPT-5.1-CodexOpenAI	OpenAI	85.0	+11	fragile
GPT-5.1-Codex-MiniOpenAI	OpenAI	85.0	-18	fragile
Qwen3 VL 8B ThinkingAlibaba	Alibaba	85.0	-8	fragile
Qwen3 VL 30B A3B ThinkingAlibaba	Alibaba	85.0	+11	fragile
GPT-5 CodexOpenAI	OpenAI	85.0	-27	fragile
o4 Mini HighOpenAI	OpenAI	85.0	-15	fragile
Grok Code Fast 1xAI	xAI	84.8	+10	fragile
Gemini 2.5 ProGoogle	Google	84.8	-20	fragile
Gemini 2.5 Pro Preview 06-05Google	Google	84.3	+15	fragile
Gemini 2.5 Flash Lite Preview 09-2025Google	Google	83.7	+11	fragile
o4 MiniOpenAI	OpenAI	83.7	-23	fragile
MiniMax M2.5 (free)MiniMax	MiniMax	83.4	+6	fragile
Grok 4 FastxAI	xAI	83.3	-19	fragile
GPT-5.2 ChatOpenAI	OpenAI	82.9	+17	fragile
Qwen Plus 0728 (thinking)Alibaba	Alibaba	82.8	-14	fragile
Gemini 2.5 Pro Preview 05-06Google	Google	82.7	+14	fragile
MiMo-V2-FlashXiaomi	Xiaomi	82.6	-15	fragile
Trinity Miniarcee-ai	arcee-ai	82.4	-15	fragile
Nemotron Nano 12B 2 VL (free)NVIDIA	NVIDIA	82.3	-15	fragile
Grok 4.20 Multi-Agent BetaxAI	xAI	82.2	+17	fragile
Claude Opus 4.1Anthropic	Anthropic	82.0	-11	fragile
Gemini 3.1 Flash Lite PreviewGoogle	Google	81.9	+11	fragile
Qwen3.5 397B A17BAlibaba	Alibaba	81.8	-7	fragile
Claude Opus 4Anthropic	Anthropic	81.7	-11	fragile
Mercury 2Inception	Inception	81.3	+9	fragile
Qwen3 VL 32B InstructAlibaba	Alibaba	80.9	-12	fragile
Qwen3 VL 8B InstructAlibaba	Alibaba	80.9	+17	fragile
Qwen3 30B A3B Thinking 2507Alibaba	Alibaba	80.9	+9	fragile
Gemini 2.5 FlashGoogle	Google	80.1	-15	fragile
Claude Sonnet 4Anthropic	Anthropic	79.9	-11	fragile
Qwen3.5-122B-A10BAlibaba	Alibaba	79.7	+6	fragile
Qwen3.5-FlashAlibaba	Alibaba	79.4	-11	fragile
Qwen3.5-9BAlibaba	Alibaba	79.3	+14	fragile
Qwen3.5-27BAlibaba	Alibaba	79.1	-16	fragile
Qwen3 Coder PlusAlibaba	Alibaba	78.6	-7	fragile
Qwen3.5-35B-A3BAlibaba	Alibaba	78.3	+18	fragile
Step 3.5 Flash (free)StepFun	StepFun	78.2	-10	fragile
Nova Premier 1.0Amazon	Amazon	77.8	-8	fragile
KAT-Coder-Pro V1Kuaishou	Kuaishou	77.4	+16	fragile
Qwen3 VL 235B A22B ThinkingAlibaba	Alibaba	77.4	-10	fragile
GPT-4.1 MiniOpenAI	OpenAI	77.4	+15	fragile
DeepSeek V3.2 ExpDeepSeek	DeepSeek	77.2	-7	fragile
DeepSeek V3.2 SpecialeDeepSeek	DeepSeek	77.1	-9	fragile
Qwen Plus 0728Alibaba	Alibaba	77.0	+14	fragile
Qwen3 Coder NextAlibaba	Alibaba	76.7	-10	fragile
o1-proOpenAI	OpenAI	76.5	+16	fragile
MiniMax M2.5MiniMax	MiniMax	76.0	+14	fragile
Qwen3 MaxAlibaba	Alibaba	76.0	-12	fragile
o1OpenAI	OpenAI	75.7	-15	fragile
GPT-5 NanoOpenAI	OpenAI	75.6	+17	fragile
Qwen3 30B A3B Instruct 2507Alibaba	Alibaba	75.2	+6	fragile
ERNIE 4.5 VL 28B A3BBaidu	Baidu	75.0	+14	fragile
GPT-5 ChatOpenAI	OpenAI	75.0	+11	fragile
DeepSeek V3.2DeepSeek	DeepSeek	74.1	-14	fragile
Qwen3 VL 235B A22B InstructAlibaba	Alibaba	74.0	-7	fragile
DeepSeek V3.1DeepSeek	DeepSeek	73.8	-13	fragile
gpt-oss-120b (free)OpenAI	OpenAI	73.8	-12	fragile
gpt-oss-20b (free)OpenAI	OpenAI	73.8	+21	fragile
DeepSeek V3.1 TerminusDeepSeek	DeepSeek	73.7	+10	fragile
Grok 3xAI	xAI	73.7	+14	fragile
Nemotron 3 SuperNVIDIA	NVIDIA	73.5	+21	fragile
Ministral 3 14B 2512Mistral AI	Mistral AI	73.5	-21	fragile
Mistral Large 3 2512Mistral AI	Mistral AI	73.5	-7	fragile
o3 MiniOpenAI	OpenAI	73.4	+11	fragile
DeepSeek V3 0324DeepSeek	DeepSeek	73.2	-7	fragile
MiniMax M2.1MiniMax	MiniMax	73.1	+20	fragile
GPT-4o-mini Search PreviewOpenAI	OpenAI	72.9	-8	fragile
LongCat Flash ChatMeituan	Meituan	72.8	+16	fragile
Nova 2 LiteAmazon	Amazon	72.7	-21	fragile
MiniMax M2MiniMax	MiniMax	72.7	+19	fragile
Qwen3 Next 80B A3B ThinkingAlibaba	Alibaba	72.7	-20	fragile
Trinity Large Preview (free)arcee-ai	arcee-ai	72.6	+21	fragile
Ministral 3 3B 2512Mistral AI	Mistral AI	72.6	+21	fragile
Trinity Mini (free)arcee-ai	arcee-ai	72.6	+8	fragile
Nemotron Nano 12B 2 VLNVIDIA	NVIDIA	72.6	-15	fragile
Hunyuan A13B InstructTencent	Tencent	72.3	-27	fragile
GPT-4o AudioOpenAI	OpenAI	72.1	-13	fragile
Llama 4 ScoutMeta	Meta	72.0	-12	fragile
Nemotron Nano 9B V2NVIDIA	NVIDIA	71.6	-18	fragile
Qwen3 32BAlibaba	Alibaba	71.4	-16	fragile
Jamba Large 1.7AI21 Labs	AI21 Labs	71.2	-7	fragile
Mistral Medium 3.1Mistral AI	Mistral AI	70.3	+13	fragile
Qwen3 Next 80B A3B InstructAlibaba	Alibaba	70.1	+7	fragile
ERNIE 4.5 21B A3B ThinkingBaidu	Baidu	70.0	-19	fragile
Claude 3.7 Sonnet (thinking)Anthropic	Anthropic	69.8	+8	fragile
DeepSeek V3DeepSeek	DeepSeek	69.7	-17	fragile
Aion-2.0aion-labs	aion-labs	69.2	-9	fragile
Qwen3 Coder 480B A35B (free)Alibaba	Alibaba	69.0	+9	fragile
Llama 3.3 Nemotron Super 49B V1.5NVIDIA	NVIDIA	68.6	-8	fragile
GPT AudioOpenAI	OpenAI	68.4	-6	fragile
GPT Audio MiniOpenAI	OpenAI	68.4	-12	fragile
MiniMax M1MiniMax	MiniMax	68.4	+11	fragile
Nemotron 3 Nano 30B A3B (free)NVIDIA	NVIDIA	67.7	+16	fragile
gpt-oss-120bOpenAI	OpenAI	67.7	-12	fragile
Mistral Small 3.2 24BMistral AI	Mistral AI	67.3	+10	fragile
Cogito v2.1 671Bdeepcogito	deepcogito	66.7	+6	fragile
Olmo 3 32B ThinkAllen AI	Allen AI	66.3	-8	fragile
Grok 3 Mini BetaxAI	xAI	66.1	+9	fragile
Claude 3.5 SonnetAnthropic	Anthropic	65.8	-8	fragile
Llama 3.3 70B InstructMeta	Meta	65.7	-15	fragile
o3 Mini HighOpenAI	OpenAI	65.4	+21	fragile
ERNIE 4.5 21B A3BBaidu	Baidu	65.2	+15	fragile
Mistral Medium 3Mistral AI	Mistral AI	65.0	-6	fragile
Olmo 3.1 32B InstructAllen AI	Allen AI	64.9	-13	fragile
Codestral 2508Mistral AI	Mistral AI	64.8	-17	fragile
GPT-4o-miniOpenAI	OpenAI	64.6	+10	fragile
GPT-4oOpenAI	OpenAI	64.4	+6	fragile
Gemma 3 27BGoogle	Google	63.6	+11	fragile
GPT-4o (2024-11-20)OpenAI	OpenAI	63.3	+8	fragile
Sonar ProPerplexity	Perplexity	63.1	+8	fragile
Qwen3 4B (free)Alibaba	Alibaba	63.0	-16	fragile
Gemma 3 27B (free)Google	Google	62.8	-6	fragile
Kimi K2 0711Moonshot AI	Moonshot AI	62.7	-17	fragile
Devstral Small 1.1Mistral AI	Mistral AI	62.6	-11	fragile
Spotlightarcee-ai	arcee-ai	62.3	-15	fragile
Mistral Small 3.1 24B (free)Mistral AI	Mistral AI	62.2	+11	fragile
MiniMax-01MiniMax	MiniMax	62.0	-6	fragile
Sonar Reasoning ProPerplexity	Perplexity	61.6	+8	fragile
R1 Distill Llama 70BDeepSeek	DeepSeek	61.0	+8	fragile
Qwen VL PlusAlibaba	Alibaba	60.9	+8	fragile
Qwen2.5 VL 72B InstructAlibaba	Alibaba	60.3	+10	fragile
R1 Distill Qwen 32BDeepSeek	DeepSeek	60.2	+12	fragile
Gemma 2 27BGoogle	Google	59.7	-6	fragile
Phi 4Microsoft	Microsoft	59.6	+6	fragile
Mistral Small 3Mistral AI	Mistral AI	59.5	+6	fragile
MiniMax M2-herMiniMax	MiniMax	59.4	-8	fragile
LFM2.5-1.2B-Thinking (free)Liquid AI	Liquid AI	59.0	-7	fragile
Mistral Small CreativeMistral AI	Mistral AI	59.0	-17	fragile
Llama Guard 4 12BMeta	Meta	59.0	-17	fragile
Llama 3.1 Nemotron Ultra 253B v1NVIDIA	NVIDIA	57.5	-6	fragile
Qwen2.5 VL 32B InstructAlibaba	Alibaba	56.7	+15	fragile
Aion-1.0aion-labs	aion-labs	56.6	+15	fragile
Aion-1.0-Miniaion-labs	aion-labs	56.6	+6	fragile
Gemma 3 4BGoogle	Google	56.2	-12	fragile
GPT-4o (2024-08-06)OpenAI	OpenAI	55.6	+12	fragile
GPT-4o (extended)OpenAI	OpenAI	54.3	-10	fragile
Mistral LargeMistral AI	Mistral AI	54.1	-9	fragile
SonarPerplexity	Perplexity	53.7	+7	fragile
LFM2-8B-A1BLiquid AI	Liquid AI	53.2	+9	fragile
LFM2-2.6BLiquid AI	Liquid AI	53.2	-16	fragile
Mistral Large 2407Mistral AI	Mistral AI	53.0	-8	fragile
Claude 3 HaikuAnthropic	Anthropic	43.0	+9	fragile
Llama Guard 3 8BMeta	Meta	42.9	+6	fragile
GPT-4 Turbo (older v1106)OpenAI	OpenAI	42.7	+6	fragile
Llama 3.1 8B InstructMeta	Meta	42.4	-6	fragile
GPT-4OpenAI	OpenAI	39.0	+6	fragile

Leaderboard Snapshot

Top 10 AI models this week by composite score.

#	Model	Provider	Score	7d Change
1	GPT-5.4 ProOpenAI	OpenAI	94.0	0
2	GPT-5.4OpenAI	OpenAI	94.0	+4
3	GPT-5.4 MiniOpenAI	OpenAI	93.3	+301
4	GPT-5.2 ProOpenAI	OpenAI	92.7	-1
5	GPT-5.2OpenAI	OpenAI	92.7	+4
6	Claude Opus 4.6Anthropic	Anthropic	92.1	-1
7	GPT-5 ProOpenAI	OpenAI	91.9	0
8	o3 Deep ResearchOpenAI	OpenAI	91.5	-4
9	Claude Opus 4.5Anthropic	Anthropic	90.4	-1
10	Gemini 3 Pro PreviewGoogle	Google	90.3	+1

This Week's Numbers

Total Ranked

300

Avg Score

68.2

Top 10 Moved

8/10

Most Active Provider

OpenAI

Weekly AI Performance Report

Executive Summary

Weekly Movers

Top Gainers

Top Losers

New Models

Watch List

Leaderboard Snapshot

This Week's Numbers

Read More Analysis

Weekly AI Performance Report

Executive Summary

Weekly Movers

Top Gainers

Top Losers

New Models

Watch List

Leaderboard Snapshot

This Week's Numbers

Read More Analysis