Qwen3.5-Flash

by AlibabaRank #80Score 79.4

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

Performance Overview

Score

79.4

Rank

#80

24h Change

7d Change

-11

State

stable

Confidence

high

Signal Scores

Signal	Normalized	Weight	Contribution	Freshness
Capabilities capability	83.3	20%	16.7	2026-03-23T03:03:05.893Z
Benchmarks benchmark	66.7	30%	20.0	2026-03-23T03:03:05.893Z
Pricing pricing_tier	0.3	15%	0.0	2026-03-23T03:03:05.893Z
Context Window context_window	95.2	10%	9.5	2026-03-23T03:03:05.893Z
Recency recency	100.0	15%	15.0	2026-03-23T03:03:05.893Z
Output Capacity output_capacity	80.3	10%	8.0	2026-03-23T03:03:05.893Z

Top Drivers

positive

Benchmarks

positive

Capabilities

Supports reasoning, vision, tools, JSON mode, streaming

5/7

positive

Recency

Released within the last month

25d ago

positive

Context Window

1M token context window

Capabilities

Capability	Supported
Vision	Yes
Reasoning	Yes
JSON Mode	Yes
Streaming	Yes
Function Calling	Yes
Web Search	No

Pricing

Input / 1M tokens

$0.07

Output / 1M tokens

$0.26

Context Window

1000K

Max Output

66K

Model Detail All Trackers Leaderboard

Qwen3.5-Flash

Performance Overview

Signal Scores

Top Drivers

Capabilities

Pricing

Related Models

Related

Qwen3.5-Flash

Performance Overview

Signal Scores

Top Drivers

Capabilities

Pricing

Related Models

Related