Signal Breakdown Explorer

Understand how composite scores are built from individual signals. Each signal measures a distinct quality dimension, weighted and combined to produce the final score for every model.

Signal Overview

High-level summary of the signal data across all scored models.

Signals Tracked

unique quality dimensions

Models with Signal Data

293

of 293 total models

Avg Signals per Model

6.0

signals per model on average

Signal Importance Rankings

Signals ranked by their average weight in the composite score calculation. Higher weight means greater influence on the final score.

Signal	Avg Weight	Avg Score	Avg Contribution	Max Contribution	Top Model
Capabilities	25.0%	50.9	12.73	21.40	GPT-5.4 Pro(21.4)
Pricing Tier	25.0%	8.7	2.18	25.00	GPT-5.4 Pro(25.0)
Context Window	15.0%	81.7	12.26	15.00	Grok 4.1 Fast(15.0)
Recency	15.0%	76.3	11.44	15.00	GPT-5.4 Pro(15.0)
Output Capacity	10.0%	59.1	5.90	10.00	MiniMax-01(10.0)
Versatility	10.0%	46.1	4.60	10.00	Gemini 3.1 Pro Preview Custom Tools(10.0)

Signal Contribution Breakdown

Top 10 models by composite score with stacked signal contributions. Each colored segment is proportional to that signal's contribution to the total score.

Capabilities

Pricing Tier

Context Window

Recency

Output Capacity

Versatility

GPT-5.4 Pro90.9

GPT-5.2 Pro89.9

GPT-5 Pro89.9

o3 Pro81.6

Claude Opus 4.181.1

o1-pro77.2

Claude Opus 475.5

o3 Deep Research74.0

Claude Opus 4.670.5

Claude Opus 4.570.0

Signal Leaders

For each signal, the top 5 models ranked by that signal's contribution to their composite score.

Capabilities

Pricing Tier

1.GPT-5.4 Pro25.0
2.GPT-5.2 Pro25.0
3.GPT-5 Pro25.0
4.o1-pro25.0
5.o3 Pro20.0

Context Window

Recency

Output Capacity

Versatility

1.Gemini 3.1 Pro Preview Custom Tools10.0
2.Gemini 3.1 Pro Preview10.0
3.Gemini 3 Pro Preview10.0
4.Gemini 3 Flash Preview10.0
5.Gemini 3.1 Flash Lite Preview10.0

Signal Correlations

Which signals tend to move together? Pearson correlation coefficient between signal scores across all models. Values near +1 indicate signals that rise and fall together; values near -1 indicate inverse relationships.

Most Correlated Pairs

Capabilities ↔ Versatility+0.569

Capabilities ↔ Context Window+0.558

Context Window ↔ Versatility+0.514

Context Window ↔ Recency+0.469

Output Capacity ↔ Versatility+0.366

Least Correlated Pairs

Pricing Tier ↔ Recency-0.000

Context Window ↔ Pricing Tier+0.051

Output Capacity ↔ Recency+0.090

Output Capacity ↔ Pricing Tier+0.139

Recency ↔ Versatility+0.159

Methodology

How signals work and contribute to the composite score.

What Signals Represent

Signals are individual quality dimensions that capture different aspects of a model's value. Each signal measures a specific attribute such as benchmark performance, pricing efficiency, context capacity, or capability breadth. Together, they provide a multi-dimensional view of model quality.

How Weights Are Applied

Each signal is assigned a weight reflecting its importance in the overall assessment. Weights are expressed as fractions summing to 1.0 (100%). A signal with weight 0.25 contributes up to 25% of the composite score. Weights are calibrated based on the signal's relevance to practical model quality.

Normalized Score (0–100)

Each signal's raw value is normalized to a 0–100 scale to make signals comparable regardless of their original units. A score of 100 means the model ranks at the top for that signal, while 0 indicates the lowest possible performance. Z-scores are computed first, then mapped to the 0–100 range.

How Contribution Works

A signal's contribution equals its weight multiplied by its normalized score. For example, a signal with weight 0.25 and normalized score 80 contributes 20 points to the composite score. The sum of all contributions gives the final composite score. This makes it easy to see which signals drive each model's ranking.

Explore More

Continue exploring AI model data with benchmarks, the full leaderboard, and other explorer views.

All Explorers Benchmarks Leaderboard

Signal Importance Rankings

Signals ranked by their average weight in the composite score calculation. Higher weight means greater influence on the final score.

Signal	Avg Weight	Avg Score	Avg Contribution	Max Contribution	Top Model
Capabilities	25.0%	50.9	12.73	21.40	GPT-5.4 Pro(21.4)
Pricing Tier	25.0%	8.7	2.18	25.00	GPT-5.4 Pro(25.0)
Context Window	15.0%	81.7	12.26	15.00	Grok 4.1 Fast(15.0)
Recency	15.0%	76.3	11.44	15.00	GPT-5.4 Pro(15.0)
Output Capacity	10.0%	59.1	5.90	10.00	MiniMax-01(10.0)
Versatility	10.0%	46.1	4.60	10.00	Gemini 3.1 Pro Preview Custom Tools(10.0)

Signal Contribution Breakdown

Top 10 models by composite score with stacked signal contributions. Each colored segment is proportional to that signal's contribution to the total score.

Capabilities

Pricing Tier

Context Window

Recency

Output Capacity

Versatility

Signal Correlations

Most Correlated Pairs

Capabilities ↔ Versatility+0.569

Capabilities ↔ Context Window+0.558

Context Window ↔ Versatility+0.514

Context Window ↔ Recency+0.469

Output Capacity ↔ Versatility+0.366

Least Correlated Pairs

Pricing Tier ↔ Recency-0.000

Context Window ↔ Pricing Tier+0.051

Output Capacity ↔ Recency+0.090

Output Capacity ↔ Pricing Tier+0.139

Recency ↔ Versatility+0.159

Methodology

How signals work and contribute to the composite score.