Browse and compare AI models across providers, modalities, and use cases.
Showing 11 of 11 models
Gemini 1.5 Flash is a fast and versatile multimodal model for scaling across diverse tasks.
Pricing
Input: $0.07 / 1M tokensOutput: $0.30 / 1M tokens
Context
1.0M
Gemini 1.5 Flash-8B is a small model designed for lower intelligence tasks.
Pricing
Input: $0.04 / 1M tokensOutput: $0.15 / 1M tokens
Context
1.0M
Try Gemini 2.5 Pro Preview, our most advanced Gemini model to date.
Pricing
Input: $1.25 / 1M tokensOutput: $5.00 / 1M tokens
Context
2.1M
Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, native tool use, multimodal generation, and a 1M token context window.
Pricing
Input: $0.10 / 1M tokensOutput: $0.40 / 1M tokens
Context
1.0M
The Gemini 2.0 Flash Live model works with the Live API to enable low-latency bidirectional voice and video interactions with Gemini. The model can process text, audio, and video input, and it can provide text and audio output.
Pricing
Input: $0.10 / 1M tokensOutput: $0.40 / 1M tokens
Context
1.0M
A Gemini 2.0 Flash model optimized for cost efficiency and low latency.
Pricing
Input: $0.07 / 1M tokensOutput: $0.30 / 1M tokens
Context
1.0M
Our best model in terms of price-performance, offering well-rounded capabilities. Gemini 2.5 Flash rate limits are more restricted since it is an experimental / preview model.
Pricing
Input: $0.15 / 1M tokensOutput: $0.60 / 1M tokens
Context
1.0M
Gemini 2.5 Pro is our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context. Gemini 2.5 Pro rate limits are more restricted since it is an experimental / preview model.
Pricing
Input: $1.25 / 1M tokensOutput: $10.00 / 1M tokens
Context
1.0M
Gemini embedding achieves a SOTA performance across many key dimensions including code, multi-lingual, and retrieval. Gemini Embedding rate limits are more restricted since it is an experimental model.
Context
8.2K
Imagen 3 is our highest quality text-to-image model, capable of generating images with even better detail, richer lighting and fewer distracting artifacts than our previous models.
Veo 2 is our high quality text- and image-to-video model, capable of generating detailed videos, capturing the artistic nuance in your prompts.