Score Distribution

Q: What is the AI model score distribution?

The score distribution shows how all 290+ tracked AI models are spread across the 0-100 SignalScore scale. Most models cluster in the 40-70 range, with a small elite group scoring above 80 and budget/older models falling below 30.

Q: How is the SignalScore calculated?

SignalScore is a composite metric combining six weighted factors: capability breadth (25%), pricing tier (25%), context window (15%), recency (15%), output capacity (10%), and versatility (10%). Each factor is normalized to a 0-100 scale before weighting.

Q: What score percentile is considered good for an AI model?

Models scoring above the 75th percentile (typically 65+ SignalScore) are considered strong performers. The top 10% of models score above 78, while the median score across all tracked models sits around 52-55.

Statistical analysis of how 300 AI model composite scores are distributed. Explore the mean, median, percentiles, and tier breakdowns to understand the AI model landscape.

Key Statistics

Summary statistics across all 300 scored models.

Mean Score

68.2

+/- 14.6 stddev

Median Score

70.8

Score Range

32—94

95th Percentile

86.9

Above Median

150

of 300 models

Score Distribution (10-point buckets)

Score Distribution

Number of models in each 10-point score bucket.

0-10

10-20

20-30

30-40

40-50

50-60

60-70

70-80

80-90

90-100

Elite

Strong

Average

Below Average

Weak

Score Tiers

Models grouped by performance tier with summary statistics.

Tier	Range	Count	% of Total	Avg Score	Top Model
Elite	90–100	11	3.7%	92.1	GPT-5.4 Pro(94.0)
Strong	70–89	141	47.0%	78.8	Claude Sonnet 4.5(89.0)
Average	50–69	102	34.0%	61.0	Qwen3 Coder 480B A35B (free)(69.0)
Below Average	30–49	37	12.3%	40.5	Command R (08-2024)(47.8)
Weak	0–29	0	0.0%	—	—

Percentile Breakdown

Score thresholds at key percentile levels.

Percentile	Score	Position
P5	39.0	3294
P10	44.7	3294
P25	59.3	3294
P50	70.8	3294
P75	80.3	3294
P90	85.0	3294
P95	86.9	3294

Top Providers by Average Score

Providers with 3+ models, ranked by average composite score.

Provider	Models	Avg Score	Best	Worst	Spread
1Xiaomi	3	84.2	85.0	82.6	2.4
2ByteDance	5	80.5	85.0	62.7	22.3
3xAI	10	78.8	86.9	63.5	23.4
4Anthropic	13	77.3	92.1	43.0	49.1
5OpenAI	60	72.5	94.0	32.2	61.8
6MiniMax	8	72.3	83.4	59.4	24.0
7Google	23	71.9	90.3	46.3	44.0
8Moonshot AI	4	71.5	85.0	62.7	22.3
9DeepSeek	11	71.5	77.7	60.2	17.5
10NVIDIA	11	70.6	84.1	53.2	30.9

Score Concentration

How models are distributed across the top 20%, middle 60%, and bottom 20% of scores.

Top 20%(Score >= 82.2)

60 (20.0%)

Middle 60%(Score 55.7 - 82.2)

180 (60.0%)

Bottom 20%(Score <= 55.7)

60 (20.0%)

Methodology

How scores are computed and what the distribution reveals.

How Scores Are Computed

Each model receives a composite score from 0 to 100, calculated as a weighted combination of six signals: capabilities (25%), pricing tier (25%), context window (15%), recency (15%), output capacity (10%), and versatility (10%). The score is designed to capture overall model quality and value in a single number.

What the Distribution Tells Us

The score distribution reveals the competitive landscape of AI models. A tight cluster near the median suggests many similarly capable models, while a wide spread indicates clear differentiation between tiers. The shape of the distribution, its skew, and the gap between mean and median all provide insight into whether the market is top-heavy, bottom-heavy, or evenly distributed.

探索更多

Continue exploring AI model data with benchmarks, capabilities, and the full leaderboard.

All Explorers Benchmarks Leaderboard

Frequently Asked Questions

The score distribution shows how all 290+ tracked AI models are spread across the 0-100 SignalScore scale. Most models cluster in the 40-70 range, with a small elite group scoring above 80 and budget/older models falling below 30.

SignalScore is a composite metric combining six weighted factors: capability breadth (25%), pricing tier (25%), context window (15%), recency (15%), output capacity (10%), and versatility (10%). Each factor is normalized to a 0-100 scale before weighting.

Models scoring above the 75th percentile (typically 65+ SignalScore) are considered strong performers. The top 10% of models score above 78, while the median score across all tracked models sits around 52-55.

Score Tiers

Models grouped by performance tier with summary statistics.

Tier	Range	Count	% of Total	Avg Score	Top Model
Elite	90–100	11	3.7%	92.1	GPT-5.4 Pro(94.0)
Strong	70–89	141	47.0%	78.8	Claude Sonnet 4.5(89.0)
Average	50–69	102	34.0%	61.0	Qwen3 Coder 480B A35B (free)(69.0)
Below Average	30–49	37	12.3%	40.5	Command R (08-2024)(47.8)
Weak	0–29	0	0.0%	—	—

Percentile

Score

Position

39.0

3294

P10

44.7

3294

P25

59.3

3294

P50

70.8

3294

P75

80.3

3294

P90

85.0

3294

P95

86.9

3294

Top Providers by Average Score

Providers with 3+ models, ranked by average composite score.

Provider	Models	Avg Score	Best	Worst	Spread
1Xiaomi	3	84.2	85.0	82.6	2.4
2ByteDance	5	80.5	85.0	62.7	22.3
3xAI	10	78.8	86.9	63.5	23.4
4Anthropic	13	77.3	92.1	43.0	49.1
5OpenAI	60	72.5	94.0	32.2	61.8
6MiniMax	8	72.3	83.4	59.4	24.0
7Google	23	71.9	90.3	46.3	44.0
8Moonshot AI	4	71.5	85.0	62.7	22.3
9DeepSeek	11	71.5	77.7	60.2	17.5
10NVIDIA	11	70.6	84.1	53.2	30.9

Methodology

How scores are computed and what the distribution reveals.