Browse and compare AI models across providers, modalities, and use cases.
Showing 20 of 23 models
Gemini 1.5 Flash is a fast and versatile multimodal model for scaling across diverse tasks.
Pricing
Input: $0.07 / 1M tokensOutput: $0.30 / 1M tokens
Context
1.0M
Gemini 1.5 Flash-8B is a small model designed for lower intelligence tasks.
Pricing
Input: $0.04 / 1M tokensOutput: $0.15 / 1M tokens
Context
1.0M
Try Gemini 2.5 Pro Preview, our most advanced Gemini model to date.
Pricing
Input: $1.25 / 1M tokensOutput: $5.00 / 1M tokens
Context
2.1M
Gemini 2.0 Flash is deprecated and will be shut down on March 31, 2026.
Pricing
Input: $0.10 / 1M tokensOutput: $0.40 / 1M tokens
Context
1.0M
The Gemini 2.0 Flash Live model works with the Live API to enable low-latency bidirectional voice and video interactions with Gemini. The model can process text, audio, and video input, and it can provide text and audio output.
Pricing
Input: $0.10 / 1M tokensOutput: $0.40 / 1M tokens
Context
1.0M
Gemini 2.0 Flash-Lite is deprecated and will be shut down on March 31, 2026.
Pricing
Input: $0.07 / 1M tokensOutput: $0.30 / 1M tokens
Context
1.0M
Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.
Pricing
Input: $0.30 / 1M tokensOutput: $2.50 / 1M tokens
Context
1.0M
Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.
Pricing
Input: $0.30 / 1M tokensOutput: $0.04 / 1M tokens
Context
131.1K
Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.
Pricing
Input: $0.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
8.2K
Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.
Pricing
Input: $0.30 / 1M tokensOutput: $0.04 / 1M tokens
Context
65.5K
Our best model in terms of price-performance, offering well-rounded capabilities. 2.5 Flash is best for large scale processing, low-latency, high volume tasks that require thinking, and agentic use cases.
Pricing
Input: $0.30 / 1M tokensOutput: $2.50 / 1M tokens
Context
1.0M
Our best model in terms of price-performance, offering well-rounded capabilities. Gemini 2.5 Flash rate limits are more restricted since it is an experimental / preview model.
Pricing
Input: $0.15 / 1M tokensOutput: $0.60 / 1M tokens
Context
1.0M
Our fastest flash model optimized for cost-efficiency and high throughput.
Pricing
Input: $0.10 / 1M tokensOutput: $0.40 / 1M tokens
Context
1.0M
Our fastest flash model optimized for cost-efficiency and high throughput.
Pricing
Input: $0.10 / 1M tokensOutput: $0.40 / 1M tokens
Context
1.0M
Our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.
Pricing
Input: $1.00 / 1M tokensOutput: $20.00 / 1M tokens
Context
8.2K
Our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context.
Pricing
Input: $1.25 / 1M tokensOutput: $10.00 / 1M tokens
Context
1.0M
Gemini 2.5 Pro is our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context. Gemini 2.5 Pro rate limits are more restricted since it is an experimental / preview model.
Pricing
Input: $1.25 / 1M tokensOutput: $10.00 / 1M tokens
Context
1.0M
Our most balanced model built for speed, scale, and frontier intelligence.
Pricing
Input: $0.50 / 1M tokensOutput: $3.00 / 1M tokens
Context
1.0M
The best model in the world for multimodal understanding, and our most powerful agentic and vibe-coding model yet, delivering richer visuals and deeper interactivity, all built on a foundation of state-of-the-art reasoning.
Pricing
Input: $0.00 / 1M tokensOutput: $0.13 / 1M tokens
Context
65.5K
The best model in the world for multimodal understanding, and our most powerful agentic and vibe-coding model yet, delivering richer visuals and deeper interactivity, all built on a foundation of state-of-the-art reasoning.
Pricing
Input: $2.00 / 1M tokensOutput: $12.00 / 1M tokens
Context
1.0M