Browse and compare AI models across providers, modalities, and use cases.
Showing 20 of 62 models
ChatGPT-4o points to the GPT-4o snapshot currently used in ChatGPT. We recommend using an API model like GPT-5 or GPT-4o for most API integrations, but feel free to use this ChatGPT-4o model to test our latest improvements for chat use cases.
Pricing
Input: $5.00 / 1M tokensOutput: $15.00 / 1M tokens
Context
128.0K
DALL·E is an AI system that creates realistic images and art from a natural language description. Older than DALL·E 3, DALL·E 2 offers more control in prompting and more requests at once.
DALL·E is an AI system that creates realistic images and art from a natural language description. DALL·E 3 currently supports the ability, given a prompt, to create a new image with a specific size.
GPT Image 1 is our new state-of-the-art image generation model. It is a natively multimodal language model that accepts both text and image inputs, and produces image outputs.
Pricing
Input: $5.00 / 1M tokensOutput: $40.00 / 1M tokens
GPT-3.5 Turbo models can understand and generate natural language or code and have been optimized for chat using the Chat Completions API but work well for non-chat tasks as well. As of July 2024, use gpt-4o-mini in place of GPT-3.5 Turbo, as it is cheaper, more capable, multimodal, and just as fast. GPT-3.5 Turbo is still available for use in the API.
Pricing
Input: $0.50 / 1M tokensOutput: $1.50 / 1M tokens
Context
16.4K
GPT-4 is an older version of a high-intelligence GPT model, usable in Chat Completions.
Pricing
Input: $30.00 / 1M tokensOutput: $60.00 / 1M tokens
Context
8.2K
GPT-4 Turbo is the next generation of GPT-4, an older high-intelligence GPT model. It was designed to be a cheaper, better version of GPT-4. Today, we recommend using a newer model like GPT-4o.
Pricing
Input: $10.00 / 1M tokensOutput: $30.00 / 1M tokens
Context
128.0K
This is a research preview of the GPT-4 Turbo model, an older high-intelligence GPT model.
Pricing
Input: $10.00 / 1M tokensOutput: $30.00 / 1M tokens
Context
128.0K
GPT-4.1 excels at instruction following and tool calling, with broad knowledge across domains. It features a 1M token context window, and low latency without a reasoning step.
Pricing
Input: $2.00 / 1M tokensOutput: $8.00 / 1M tokens
Context
1.0M
GPT-4.1 mini excels at instruction following and tool calling. It features a 1M token context window, and low latency without a reasoning step.
Pricing
Input: $0.40 / 1M tokensOutput: $1.60 / 1M tokens
Context
1.0M
GPT-4.1 nano excels at instruction following and tool calling. It features a 1M token context window, and low latency without a reasoning step.
Pricing
Input: $0.10 / 1M tokensOutput: $0.40 / 1M tokens
Context
1.0M
Deprecated - a research preview of GPT-4.5. We recommend using gpt-4.1 or o3 models instead for most use cases.
Pricing
Input: $75.00 / 1M tokensOutput: $150.00 / 1M tokens
Context
128.0K
GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is the best model for most tasks, and is our most capable model outside of our o-series models.
Pricing
Input: $2.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
128.0K
This is a preview release of the GPT-4o Audio models. These models accept audio inputs and outputs, and can be used in the Chat Completions REST API.
Pricing
Input: $2.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
128.0K
This is a preview release of the GPT-4o Realtime model, capable of responding to audio and text inputs in realtime over WebRTC or a WebSocket interface.
Pricing
Input: $5.00 / 1M tokensOutput: $20.00 / 1M tokens
Context
32.0K
GPT-4o Search Preview is a specialized model trained to understand and execute web search queries with the Chat Completions API. In addition to token fees, web search queries have a fee per tool call. Learn more in the pricing page.
Pricing
Input: $2.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
128.0K
GPT-4o Transcribe is a speech-to-text model that uses GPT-4o to transcribe audio. It offers improvements to word error rate and better language recognition and accuracy compared to original Whisper models. Use it for more accurate transcripts.
Pricing
Input: $2.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
16.0K
GPT-4o Transcribe Diarize is an automatic speech recognition (ASR) model with built-in speaker diarization, meaning it associates audio segments with different speakers in a conversation. This model is only available in the Transcription API.
Pricing
Input: $2.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
16.0K
GPT-4o mini (“o” for “omni”) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.
Pricing
Input: $0.15 / 1M tokensOutput: $0.60 / 1M tokens
Context
128.0K
This is a preview release of the smaller GPT-4o Audio mini model. It's designed to input audio or create audio outputs via the REST API.
Pricing
Input: $0.15 / 1M tokensOutput: $0.60 / 1M tokens
Context
128.0K