The best AI models for data extraction, ranked by extraction score. JSON mode is critical for structured output, vision enables document and image reading和function calling powers pipeline integration。
231
JSON Mode
133
With Vision
223
Function Calling
237
128K+ Context
| # | Model | Score |
|---|---|---|
| 1 | GPT-5.4 ProOpenAI | 117 |
| 2 | GPT-5.4OpenAI | 117 |
| 3 | GPT-5.4 MiniOpenAI | 116 |
| 4 | GPT-5.2 ProOpenAI | 116 |
| 5 | GPT-5.2OpenAI | 116 |
| 6 | Claude Opus 4.6Anthropic | 115 |
| 7 | GPT-5 ProOpenAI | 115 |
| 8 | o3 Deep ResearchOpenAI | 115 |
| 9 | Claude Opus 4.5Anthropic | 113 |
| 10 | Gemini 3 Pro PreviewGoogle | 113 |
| 11 | GPT-5OpenAI | 113 |
| 12 | Gemini 3 Flash PreviewGoogle | 112 |
| 13 | Claude Sonnet 4.6Anthropic | 112 |
| 14 | Claude Sonnet 4.5Anthropic | 112 |
| 15 | o3 ProOpenAI | 111 |
| 16 | Grok 4.1 FastxAI | 110 |
| 17 | Grok 4xAI | 109 |
| 18 | Grok 4.20 BetaxAI | 109 |
| 19 | o3OpenAI | 109 |
| 20 | Gemini 3.1 Pro PreviewGoogle | 109 |
| 21 | GPT-5.1OpenAI | 108 |
| 22 | MiMo-V2-OmniXiaomi | 108 |
| 23 | GPT-5.4 NanoOpenAI | 108 |
| 24 | Seed-2.0-LiteByteDance | 108 |
| 25 | GPT-5.3 ChatOpenAI | 108 |
Extract structured data from PDFs, contracts, and reports. Models with vision can read scanned documents and handwritten text, while JSON mode ensures output is machine-parseable for downstream systems. Ideal for automating document intake pipelines.
Automatically parse invoices, receipts, and financial documents into structured fields -- vendor name, line items, totals, tax amounts, and dates. Vision-capable models handle photographed or scanned receipts with high accuracy.
Feed raw HTML or page text into an LLM to extract product details, pricing, reviews, or article metadata. JSON mode guarantees consistent output schemas, and function calling enables multi-page crawl orchestration from a single prompt.
Function calling lets extraction models plug directly into your data pipeline -- calling APIs, writing to databases, or triggering downstream transformations. Combined with JSON mode, this enables fully automated ETL workflows powered by AI.
Explore models by capability, compare pricing, or dive into the full leaderboard.
根据我们每小时更新的综合评分,本页顶部显示了排名靠前的模型。排名综合考虑了基准测试、定价、功能和社区采用情况。
是的,本页列出的几款模型提供免费套餐或完全开源。请查看上方定价列中标记为免费的模型。
我们使用综合评分系统,结合基准性能、功能匹配、定价、上下文窗口大小和社区采用情况。评分每小时更新一次。
排名每小时使用基准测试、API测试和社区指标的实时数据刷新。显示的数据始终反映最新的性能。