Mercury 2

Name: Mercury 2
Price: 0.25 USD
Rating: 52.8
Author: Inception

High confidence

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving >1,000 tokens/sec on standard GPUs. Mercury 2 is 5x+ faster than leading speed-optimized LLMs like Claude 4.5 Haiku and GPT 5 Mini, at a fraction of the cost. Mercury 2 supports tunable reasoning levels, 128K context, native tool use, and schema-aligned JSON output. Built for coding workflows where latency compounds, real-time voice/search, and agent loops. OpenAI API compatible. Read more in the [blog post](https://www.inceptionlabs.ai/blog/introducing-mercury-2).

Overall Score

Rank #99 in Coding

Quality Rank

#99

Benchmarks & expert evaluations

Adoption Rank

#99

Usage, search interest & community

128.0K token context

Released 2026-03-04

Compare

#99(could be #84–#114)

#1#115

Signal Overview

Score Breakdown

Signal	Strength	Weight	Impact	Updated
Recencyjust now	100	15%	+15.0	just now
Capabilitiesjust now	57	25%	+14.3	just now
Context Windowjust now	81	15%	+12.2	just now
Output Capacityjust now	78	10%	+7.8	just now
Versatilityjust now	33	10%	+3.3	just now
Pricing Tierjust now	1	25%	+0.2	just now

Data Freshness

capabilityJust now

Just now

Context window sizeJust now

Just now

output_capacityJust now

Just now

pricing_tierJust now

Just now

recencyJust now

Just now

versatilityJust now

Just now

Capabilities

Reasoning

Vision

Function Calling

JSON Mode

Streaming

Web Search

Image Output

Modalities

Input

text

Output

text

Frequently Asked Questions

Mercury 2 by Inception excels in the Coding category, where it ranks #99 with a composite score of 53/100. Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving >1,000 tokens/sec on standard GPUs. Mercury 2 is 5x+ faster than leading speed-optimized LLMs like Claude 4.5 Haiku and GPT 5 Mini, at a fraction of the cost. Mercury 2 supports tunable reasoning levels, 128K context, native tool use, and schema-aligned JSON output. Built for coding workflows where latency compounds, real-time voice/search, and agent loops. OpenAI API compatible. Read more in the [blog post](https://www.inceptionlabs.ai/blog/introducing-mercury-2). It is particularly strong in areas highlighted by its top benchmark performance and adoption metrics, making it suitable for both individual developers and enterprise teams looking for a reliable coding solution.

Mercury 2 is priced at $0.25 per million input tokens and $0.75 per million output tokens (USD). Contact the provider for volume discounts and enterprise pricing. Pricing is competitive within the coding category and reflects the model's quality-to-cost ratio.

In the Coding category, Mercury 2 holds rank #99 out of 6 models tracked. Its quality rank is #99 and adoption rank is #99. You can use our comparison tool at /compare to see detailed side-by-side metrics with specific alternatives. Key differentiators include its composite scoring across benchmarks, community sentiment, and real-world adoption rates.

Mercury 2 has been evaluated across 6 different signals. Its strongest areas include Capabilities (57/100), Context Window (81/100), Output Capacity (78/100). These scores are derived from industry-standard benchmarks, community ratings, and real-world performance metrics. The composite score of 53/100 reflects a weighted combination of all tracked signals.

Mercury 2 is a paid model, though some providers may offer trial credits or limited free tiers for evaluation. Check Inception's website for current free tier availability and promotional offers.

Key Info

ProviderInception

CategoryCoding

Rank#99

Score53/100

Context128.0K tokens

Released2026-03-04

Max Output50.0K tokens

LicenseOpen Source

Statusstable

Pricingper 1M tokens

Best value

90% cheaper than category average

Input

$0.25

-89% vs avg

Output

$0.75

-92% vs avg

Cost Estimator

Monthly tokens (millions)

Input: 70%Output: 30%

Est. monthly cost$4.00

Category average$43.23

You save $39.23/month vs category average

Get Started

Try this model Get API access

Docs Pricing

Why This Rank

+Recency

~Capabilities

+Context Window

+Output Capacity

Similar Models

View all

gpt-oss-safeguard-20bOpenAI

53#97

Mistral Small 3.2 24BMistral AI

53#98

Qwen3 Coder PlusAlibaba

52#100

Qwen3 Coder FlashAlibaba

52#101

Llama 4 MaverickMeta

52#102

See alternatives to Mercury 2

Mercury 2

by Inception

High confidence

Overall Score

Rank #99 in Coding

Quality Rank

#99

Benchmarks & expert evaluations

Adoption Rank

#99

Usage, search interest & community

128.0K token context

Released 2026-03-04

Compare

#99(could be #84–#114)

#1#115

Signal Overview

Score Breakdown

Signal	Strength	Weight	Impact	Updated
Recencyjust now	100	15%	+15.0	just now
Capabilitiesjust now	57	25%	+14.3	just now
Context Windowjust now	81	15%	+12.2	just now
Output Capacityjust now	78	10%	+7.8	just now
Versatilityjust now	33	10%	+3.3	just now
Pricing Tierjust now	1	25%	+0.2	just now

Data Freshness

capabilityJust now

Just now

Context window sizeJust now

Just now

output_capacityJust now

Just now

pricing_tierJust now

Just now

recencyJust now

Just now

versatilityJust now

Just now

Capabilities

Reasoning

Vision

Function Calling

JSON Mode

Streaming

Web Search

Image Output

Modalities

Input

text

Output

text

Frequently Asked Questions

Mercury 2 is a paid model, though some providers may offer trial credits or limited free tiers for evaluation. Check Inception's website for current free tier availability and promotional offers.