by Google
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.
| Signal | Strength | Weight | Impact |
|---|---|---|---|
| Capabilities2026-03-03T20:26:14.779Z | 71 | 25% | +17.9 |
| Context Window2026-03-03T20:26:14.779Z | 96 | 15% | +14.3 |
| Recency2026-03-03T20:26:14.779Z | 92 | 15% | +13.8 |
| Versatility2026-03-03T20:26:14.779Z | 100 | 10% | +10.0 |
| Output Capacity2026-03-03T20:26:14.779Z | 80 | 10% | +8.0 |
| Pricing Tier2026-03-03T20:26:14.779Z | 0 | 25% | +0.1 |
Cost Estimator
You save $38.89/month vs category average