GPT-5.4
OpenAI's most capable model for complex reasoning and coding. 1.1M context, multimodal.
All tracked OpenAI models — pricing, context windows, lifecycle state, and release history in one place.
OpenAI's most capable model for complex reasoning and coding. 1.1M context, multimodal.
Highest-tier reasoning model with deep thinking. 1.1M context.
Smaller 5.4 variant for coding, computer use, and subagents. 1.1M context.
Cheapest model in the GPT-5.4 family for high-volume simple tasks.
Chat-optimized variant of the GPT-5.3 series.
OpenAI's latest coding-focused model. 91.5% LiveCodeBench. 400K context.
OpenAI's reasoning model. Excels at math, science, and complex multi-step problems. 85.3% GPQA.
Latest mini reasoning model. 83.2% GPQA, 85.9% coding benchmarks.
Previous-generation GPT-5 flagship. 400K context.
Original GPT-5 frontier model. 400K context.
GPT-4.1 model with 1M context window. Good instruction following.
Smaller GPT-4.1 model with 1M context. Cost-efficient.
Smaller reasoning model. Good balance of reasoning capability and cost. 79.1% GPQA.
OpenAI's first reasoning model. Still available but superseded by o3.
Smaller GPT-4o variant. Very cost-effective for production use.
OpenAI's former flagship. Multimodal (text + image). 128K context.
GPT-4 Turbo with 128K context and vision. Superseded by GPT-4o.
Original GPT-4 model. 8K context. Still available but expensive for its capability.
Predecessor → successor chains tracked for OpenAI models.
Smaller 5.4 variant for coding, computer use, and subagents. 1.1M context.
Cheapest model in the GPT-5.4 family for high-volume simple tasks.
Highest-tier reasoning model with deep thinking. 1.1M context.
OpenAI's most capable model for complex reasoning and coding. 1.1M context, multimodal.
Chat-optimized variant of the GPT-5.3 series.
OpenAI's latest coding-focused model. 91.5% LiveCodeBench. 400K context.
Previous-generation GPT-5 flagship. 400K context.
Original GPT-5 frontier model. 400K context.
OpenAI's reasoning model. Excels at math, science, and complex multi-step problems. 85.3% GPQA.
Latest mini reasoning model. 83.2% GPQA, 85.9% coding benchmarks.
Smaller GPT-4.1 model with 1M context. Cost-efficient.
GPT-4.1 model with 1M context window. Good instruction following.
Smaller reasoning model. Good balance of reasoning capability and cost. 79.1% GPQA.
OpenAI's first reasoning model. Still available but superseded by o3.
Smaller GPT-4o variant. Very cost-effective for production use.
OpenAI's former flagship. Multimodal (text + image). 128K context.
GPT-4 Turbo with 128K context and vision. Superseded by GPT-4o.
Original GPT-4 model. 8K context. Still available but expensive for its capability.