Browse and compare AI models across providers, modalities, and use cases.
Showing 20 of 21 models
Aya Expanse is a highly performant 32B multilingual model, designed to rival monolingual performance through innovations in instruction tuning with data arbitrage, preference training, and model merging. Serves 23 languages.
Context
131.1K
Aya Expanse is a highly performant 8B multilingual model, designed to rival monolingual performance through innovations in instruction tuning with data arbitrage, preference training, and model merging. Serves 23 languages.
Context
8.2K
Aya Vision is a state-of-the-art multimodal model excelling at a variety of critical benchmarks for language, text, and image capabilities. Serves 23 languages. This 32 billion parameter variant is focused on state-of-art multilingual performance.
Context
16.4K
Aya Vision is a state-of-the-art multimodal model excelling at a variety of critical benchmarks for language, text, and image capabilities. This 8 billion parameter variant is focused on low latency and best-in-class performance.
Context
16.4K
An instruction-following conversational model that performs language tasks with high quality, more reliably and with a longer context than our base generative models.
Context
4.1K
Command A is our most performant model to date, excelling at tool use, agents, retrieval augmented generation (RAG), and multilingual use cases. Command A has a context length of 256K, only requires two GPUs to run, and has 150% higher throughput compared to Command R+ 08-2024.
Pricing
Input: $2.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
262.1K
A smaller, faster version of command. Almost as capable, but a lot faster.
Context
4.1K
To reduce the time between major releases, we put out nightly versions of command models. For command-light, that is command-light-nightly. Be advised that command-light-nightly is the latest, most experimental, and (possibly) unstable version of its default counterpart. Nightly releases are updated regularly, without warning, and are not recommended for production use.
Context
4.1K
To reduce the time between major releases, we put out nightly versions of command models. For command, that is command-nightly. Be advised that command-nightly is the latest, most experimental, and (possibly) unstable version of its default counterpart. Nightly releases are updated regularly, without warning, and are not recommended for production use.
Context
131.1K
Command R is an instruction-following conversational model that performs language tasks at a higher quality, more reliably, and with a longer context than previous models. It can be used for complex workflows like code generation, retrieval augmented generation (RAG), tool use, and agents.
Pricing
Input: $0.30 / 1M tokensOutput: $1.20 / 1M tokens
Context
131.1K
command-r-08-2024 is an update of the Command R model, delivered in August 2024. Find more information here
Pricing
Input: $0.30 / 1M tokensOutput: $1.20 / 1M tokens
Context
131.1K
Command R+ is an instruction-following conversational model that performs language tasks at a higher quality, more reliably, and with a longer context than previous models. It is best suited for complex RAG workflows and multi-step tool use.
Pricing
Input: $2.50 / 1M tokensOutput: $10.00 / 1M tokens
Context
131.1K
command-r7b-12-2024 is a small, fast update delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning and multiple steps.
Pricing
Input: $0.04 / 1M tokensOutput: $0.15 / 1M tokens
Context
131.1K
A smaller, faster version of embed-english-v3.0. Almost as capable, but a lot faster. English only.
Context
512
A model that allows for text to be classified or turned into embeddings. English only.
Context
512
A smaller, faster version of embed-multilingual-v3.0. Almost as capable, but a lot faster. Supports multiple languages.
Context
512
Provides multilingual classification and embedding support. See supported languages here.
Context
512
A model that allows for text and images to be classified or turned into embeddings
Pricing
Input: $0.12 / 1M tokens
Context
131.1K
A model that allows for re-ranking English Language documents and semi-structured data (JSON). This model has a context length of 4096 tokens.
Context
4.1K
A model for documents and semi-structure data (JSON) that are not in English. Supports the same languages as embed-multilingual-v3.0. This model has a context length of 4096 tokens.
Context
4.1K