OpenAI Codex Pricing

Last updated: just now

Compare API pricing for 53 OpenAI models suited for coding: GPT-4o, o3, o1, GPT-4.1, and budget-friendly mini variants. See per-token costs, cost per coding request, context windows, and coding capabilities side by side.

Coding Models

Avg Score

Reasoning

Function Calling

$0.400 -- $600.00

Price Range ($/1M out)

OpenAI Coding Model Pricing

#	Model	Score	Input $/1M	Output $/1M	Context	Cost/Req
1	GPT-5.4 Pro	97	$30.00	$180.00	1.1M	$0.240
2	GPT-5.4	97	$2.50	$15.00	1.1M	$0.020
3	GPT-5.3-Codex	96	$1.75	$14.00	400K	$0.018
4	GPT-5.2-Codex	96	$1.75	$14.00	400K	$0.018
5	GPT-5.2 Pro	96	$21.00	$168.00	400K	$0.210
6	GPT-5.2	96	$1.75	$14.00	400K	$0.018
7	GPT-5.1-Codex-Max	96	$1.25	$10.00	400K	$0.013
8	GPT-5.1	96	$1.25	$10.00	400K	$0.013
9	GPT-5 Pro	96	$15.00	$120.00	400K	$0.150
10	o3 Deep Research	94	$10.00	$40.00	200K	$0.060
11	o4 Mini Deep Research	94	$2.00	$8.00	200K	$0.012
12	GPT-5	94	$1.25	$10.00	400K	$0.013
13	GPT-5 Mini	94	$0.250	$2.00	400K	$0.0025
14	GPT-5 Nano	94	$0.050	$0.400	400K	$0.00050
15	o3 Pro	91	$20.00	$80.00	200K	$0.120
16	o4 Mini High	89	$1.10	$4.40	200K	$0.0066
17	o3	89	$2.00	$8.00	200K	$0.012
18	o4 Mini	89	$1.10	$4.40	200K	$0.0066
19	GPT-5.1-Codex	88	$1.25	$10.00	400K	$0.013
20	GPT-5 Codex	88	$1.25	$10.00	400K	$0.013
21	GPT-5.1-Codex-Mini	88	$0.250	$2.00	400K	$0.0025
22	GPT-5.3 Chat	84	$1.75	$14.00	128K	$0.018
23	GPT-5.2 Chat	84	$1.75	$14.00	128K	$0.018
24	GPT-5.1 Chat	84	$1.25	$10.00	128K	$0.013
25	GPT-4.1	82	$2.00	$8.00	1.0M	$0.012
26	GPT-4.1 Mini	82	$0.400	$1.60	1.0M	$0.0024
27	GPT-4.1 Nano	82	$0.100	$0.400	1.0M	$0.00060
28	GPT-5 Chat	75	$1.25	$10.00	128K	$0.013
29	o1-pro	73	$150.00	$600.00	200K	$0.900
30	o1	72	$15.00	$60.00	200K	$0.090
31	o3 Mini	69	$1.10	$4.40	200K	$0.0066
32	GPT-4o Audio	68	$2.50	$10.00	128K	$0.015
33	GPT-4o (2024-11-20)	66	$2.50	$10.00	128K	$0.015
34	o3 Mini High	64	$1.10	$4.40	200K	$0.0066
35	GPT-4o-mini	64	$0.150	$0.600	128K	$0.00090
36	GPT-4o	63	$2.50	$10.00	128K	$0.015
37	GPT-4o-mini Search Preview	62	$0.150	$0.600	128K	$0.00090
38	GPT-4o Search Preview	62	$2.50	$10.00	128K	$0.015
39	GPT-4o (2024-08-06)	62	$2.50	$10.00	128K	$0.015
40	GPT-4o-mini (2024-07-18)	61	$0.150	$0.600	128K	$0.00090
41	GPT-4o (extended)	60	$6.00	$18.00	128K	$0.030
42	GPT-4 Turbo	60	$10.00	$30.00	128K	$0.050
43	GPT-4o (2024-05-13)	57	$5.00	$15.00	128K	$0.025
44	GPT-4 Turbo Preview	48	$10.00	$30.00	128K	$0.050
45	GPT-4 Turbo (older v1106)	48	$10.00	$30.00	128K	$0.050
46	GPT-3.5 Turbo 16k	45	$3.00	$4.00	16K	$0.010
47	GPT-3.5 Turbo	45	$0.500	$1.50	16K	$0.0025
48	GPT-4 (older v0314)	44	$30.00	$60.00	8K	$0.120
49	GPT-4	44	$30.00	$60.00	8K	$0.120
50	GPT-3.5 Turbo (older v0613)	43	$1.00	$2.00	4K	$0.0040
51	GPT-3.5 Turbo Instruct	36	$1.50	$2.00	4K	$0.0050
52	GPT-5 Image Mini	—	$2.50	$2.00	400K	$0.0070
53	GPT-5 Image	—	$10.00	$10.00	400K	$0.030

Cost/Req = estimated cost per typical coding request (2,000 input + 1,000 output tokens). Prices via OpenRouter API.

Best OpenAI Models for Coding

GPT-5.4 Pro97

Input: $30.00/1MOutput: $180.00/1MContext: 1.1MReasoningFunction CallingVision

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs. Optimized for step-by-step reasoning, instruction following, and accuracy, GPT-5.4 Pro excels at agentic coding, long-context workflows, and multi-step problem solving.

per request

$0.240

GPT-5.497

Input: $2.50/1MOutput: $15.00/1MContext: 1.1MReasoningFunction CallingVision

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for text and image inputs, enabling high-context reasoning, coding, and multimodal analysis within the same workflow. The model delivers improved performance in coding, document understanding, tool use, and instruction following. It is designed as a strong default for both general-purpose tasks and software engineering, capable of generating production-quality code, synthesizing information across multiple sources, and executing complex multi-step workflows with fewer iterations and greater token efficiency.

per request

$0.020

GPT-5.3-Codex96

Input: $1.75/1MOutput: $14.00/1MContext: 400KReasoningFunction CallingVision

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results on SWE-Bench Pro and strong performance on Terminal-Bench 2.0 and OSWorld-Verified, reflecting improved multi-language coding, terminal proficiency, and real-world computer-use skills. The model is optimized for long-running, tool-using workflows and supports interactive steering during execution, making it suitable for complex development tasks, debugging, deployment, and iterative product work. Beyond coding, GPT-5.3-Codex performs strongly on structured knowledge-work benchmarks such as GDPval, supporting tasks like document drafting, spreadsheet analysis, slide creation, and operational research across domains. It is trained with enhanced cybersecurity awareness, including vulnerability identification capabilities, and deployed with additional safeguards for high-risk use cases. Compared to prior Codex models, it is more token-efficient and approximately 25% faster, targeting professional end-to-end workflows that span reasoning, execution, and computer interaction.

per request

$0.018

GPT-5.2-Codex96

Input: $1.75/1MOutput: $14.00/1MContext: 400KReasoningFunction CallingVision

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks. The model supports building projects from scratch, feature development, debugging, large-scale refactoring, and code review. Compared to GPT-5.1-Codex, 5.2-Codex is more steerable, adheres closely to developer instructions, and produces cleaner, higher-quality code outputs. Reasoning effort can be adjusted with the `reasoning.effort` parameter. Read the [docs here](https://openrouter.ai/docs/use-cases/reasoning-tokens#reasoning-effort-level) Codex integrates into developer environments including the CLI, IDE extensions, GitHub, and cloud tasks. It adapts reasoning effort dynamically—providing fast responses for small tasks while sustaining extended multi-hour runs for large projects. The model is trained to perform structured code reviews, catching critical flaws by reasoning over dependencies and validating behavior against tests. It also supports multimodal inputs such as images or screenshots for UI development and integrates tool use for search, dependency installation, and environment setup. Codex is intended specifically for agentic coding applications.

per request

$0.018

GPT-5.2 Pro96

Input: $21.00/1MOutput: $168.00/1MContext: 400KReasoningFunction CallingVision

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy in high-stakes use cases. It supports test-time routing features and advanced prompt understanding, including user-specified intent like "think hard about this." Improvements include reductions in hallucination, sycophancy, and better performance in coding, writing, and health-related tasks.

per request

$0.210

Coding Tool Compatibility

OpenAI coding models power many of the most popular AI development tools. Here is how they integrate with the leading coding assistants.

Cursor

Uses GPT-4o and o3 as backend models for code completion, chat, and multi-file editing. OpenAI models are available via Cursor's Pro plan alongside Claude and other providers.

Best models for Cursor

GitHub Copilot

Built on OpenAI models including GPT-4o and o3-mini. Copilot uses these models for inline suggestions, chat, and code review. Enterprise plans offer access to the latest reasoning models.

Claude Code

Anthropic's CLI coding agent uses Claude models natively, but OpenAI models serve as a useful comparison point. Many developers switch between Claude Code and OpenAI-powered tools depending on the task.

Aider

Open-source AI pair programming tool with native support for all OpenAI models. Aider's benchmarks show GPT-4o and o3 performing strongly on code editing tasks. Supports function calling for precise file modifications.

Browse all AI coding assistants

Cost per Coding Request Comparison

Estimated cost per coding request based on 2,000 input tokens (prompt + code context) and 1,000 output tokens (generated code). Sorted from cheapest to most expensive.

GPT-5 Nano

$0.00050

GPT-4.1 Nano

$0.00060

GPT-4o-mini

$0.00090

GPT-4o-mini Search Preview

$0.00090

GPT-4o-mini (2024-07-18)

$0.00090

$0.0024

$0.0025

$0.0025

$0.0025

GPT-3.5 Turbo (older v0613)

$0.0040

GPT-3.5 Turbo Instruct

$0.0050

$0.0066

$0.0066

$0.0066

$0.0066

$0.0070

$0.010

o4 Mini Deep Research

$0.012

$0.012

$0.013

$0.013

$0.013

$0.013

$0.013

$0.013

$0.013

$0.015

$0.015

$0.015

GPT-4o Search Preview

$0.015

$0.015

$0.018

$0.018

$0.018

$0.018

$0.018

$0.020

$0.025

$0.030

$0.030

$0.050

$0.050

GPT-4 Turbo (older v1106)

$0.050

o3 Deep Research

$0.060

$0.090

$0.120

$0.120

$0.120

$0.150

$0.210

$0.240

$0.900

Understanding OpenAI Coding Model Pricing

GPT-4o & GPT-4.1 for Coding

GPT-4o and GPT-4.1 are OpenAI's best general-purpose coding models. They support function calling, JSON mode, and multimodal input, making them ideal for code generation, debugging, refactoring, and code review. GPT-4.1 improves on instruction following and has better performance on complex coding tasks.

o3 & o1 for Complex Code

The o-series reasoning models use chain-of-thought to tackle complex algorithms, multi-step debugging, and architectural decisions. o3 is the latest and most capable, while o1 offers strong reasoning at a lower cost. These models are best for tasks requiring deep logical reasoning rather than simple code generation.

Budget Options: Mini Models

GPT-4o Mini offers coding capabilities at a fraction of the cost. While it scores lower on complex benchmarks, it handles straightforward code generation, autocompletion, and simple debugging efficiently. Ideal for high-volume use cases like CI/CD pipelines and automated code review.

Codex Legacy & Migration

The original Codex model (code-davinci-002) was deprecated in March 2023. All Codex capabilities have been absorbed into the GPT-4 family, which significantly outperforms the original Codex on every coding benchmark. Developers should use GPT-4o or GPT-4.1 as direct Codex replacements.

Frequently Asked Questions

For most coding tasks, GPT-4o and GPT-4.1 offer the best balance of code quality and cost. For complex algorithmic or multi-step reasoning problems, o3 and o1 excel due to their chain-of-thought reasoning capabilities. GPT-4o Mini is ideal for high-volume code generation where cost efficiency is the priority.

The original OpenAI Codex model has been deprecated and replaced by the GPT-4 family. Current coding-capable models range from free (GPT-4o Mini in some tiers) to $60/M output tokens (o3-pro). A typical coding request (2,000 input + 1,000 output tokens) costs between $0.0003 and $0.06 depending on the model.

Yes. GPT-4o is one of the best models for coding. It supports function calling, JSON mode, and has strong performance on coding benchmarks like HumanEval and SWE-bench. It offers multimodal input (you can share screenshots of errors), a large context window, and competitive pricing for production coding workflows.

Both are excellent for coding. OpenAI's o3 and GPT-4.1 lead on certain benchmarks, while Anthropic's Claude 3.5 Sonnet and Claude 4 Opus excel in agentic coding tasks and long-context understanding. Claude tends to follow instructions more precisely, while OpenAI models often have broader tool ecosystem support. The best choice depends on your specific use case, budget, and tooling requirements.

All OpenAI Models Best AI for Coding Best Models for Cursor AI Coding Assistants Claude vs GPT Compare Models