Models Directory

Browse and compare AI models across providers, modalities, and use cases.

Showing 20 of 24 models

Advanced Search

Active Filters

speech

Dia Tts

Clone dialog voices from a sample audio and generate dialogs from text prompts using the Dia TTS which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

ElevenLabs Speech to Text

Generate text from speech using ElevenLabs advanced speech-to-text model.

View Details

F5 TTS

View Details

Kokoro TTS

Kokoro is a lightweight text-to-speech model that delivers comparable quality to larger models while being significantly faster and more cost-efficient.

View Details

Kokoro TTS (Brazilian Portuguese)

A natural and expressive Brazilian Portuguese text-to-speech model optimized for clarity and fluency.

View Details

Kokoro TTS (British English)

A high-quality British English text-to-speech model offering natural and expressive voice synthesis.

View Details

Kokoro TTS (French)

An expressive and natural French text-to-speech model for both European and Canadian French.

View Details

Kokoro TTS (Hindi)

A fast and expressive Hindi text-to-speech model with clear pronunciation and accurate intonation.

View Details

Kokoro TTS (Italian)

A high-quality Italian text-to-speech model delivering smooth and expressive speech synthesis.

View Details

Kokoro TTS (Japanese)

A fast and natural-sounding Japanese text-to-speech model optimized for smooth pronunciation.

View Details

Kokoro TTS (Mandarin Chinese)

A highly efficient Mandarin Chinese text-to-speech model that captures natural tones and prosody.

View Details

Kokoro TTS (Spanish)

A natural-sounding Spanish text-to-speech model optimized for Latin American and European Spanish.

View Details

MiniMax Speech-02 HD

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Speech-02 HD

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Speech-02 Turbo

Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Speech-02 Turbo

Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Voice Cloning

Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Voice Cloning

Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Voice Design

Design a personalized voice from a text description, and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

Minimax

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

Showing 20 of 24 models

Advanced Search

Active Filters

speech

Dia Tts

Clone dialog voices from a sample audio and generate dialogs from text prompts using the Dia TTS which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

ElevenLabs Speech to Text

Generate text from speech using ElevenLabs advanced speech-to-text model.

View Details

F5 TTS

View Details

Kokoro TTS

Kokoro is a lightweight text-to-speech model that delivers comparable quality to larger models while being significantly faster and more cost-efficient.

View Details

Kokoro TTS (Brazilian Portuguese)

A natural and expressive Brazilian Portuguese text-to-speech model optimized for clarity and fluency.

View Details

Kokoro TTS (British English)

A high-quality British English text-to-speech model offering natural and expressive voice synthesis.

View Details

Kokoro TTS (French)

An expressive and natural French text-to-speech model for both European and Canadian French.

View Details

Kokoro TTS (Hindi)

A fast and expressive Hindi text-to-speech model with clear pronunciation and accurate intonation.

View Details

Kokoro TTS (Italian)

A high-quality Italian text-to-speech model delivering smooth and expressive speech synthesis.

View Details

Kokoro TTS (Japanese)

A fast and natural-sounding Japanese text-to-speech model optimized for smooth pronunciation.

View Details

Kokoro TTS (Mandarin Chinese)

A highly efficient Mandarin Chinese text-to-speech model that captures natural tones and prosody.

View Details

Kokoro TTS (Spanish)

A natural-sounding Spanish text-to-speech model optimized for Latin American and European Spanish.

View Details

MiniMax Speech-02 HD

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Speech-02 HD

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Speech-02 Turbo

Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Speech-02 Turbo

Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Voice Cloning

Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Voice Cloning

Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Voice Design

Design a personalized voice from a text description, and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

Minimax

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

Models Directory

Advanced Search

Active Filters

Use Cases1

Modality

License

Inference Medium

Provider

Languages

Context Length

Parameter Range

Input Price

Output Price

Dia Tts

ElevenLabs Speech to Text

F5 TTS

Kokoro TTS

Kokoro TTS (Brazilian Portuguese)

Kokoro TTS (British English)

Kokoro TTS (French)

Kokoro TTS (Hindi)

Kokoro TTS (Italian)

Kokoro TTS (Japanese)

Kokoro TTS (Mandarin Chinese)

Kokoro TTS (Spanish)

MiniMax Speech-02 HD

MiniMax Speech-02 HD

MiniMax Speech-02 Turbo

MiniMax Speech-02 Turbo

MiniMax Voice Cloning

MiniMax Voice Cloning

MiniMax Voice Design

Minimax

Advanced Search

Active Filters

Use Cases1

Modality

License

Inference Medium

Provider

Languages

Context Length

Parameter Range

Input Price

Output Price

Dia Tts

ElevenLabs Speech to Text

F5 TTS

Kokoro TTS

Kokoro TTS (Brazilian Portuguese)

Kokoro TTS (British English)

Kokoro TTS (French)

Kokoro TTS (Hindi)

Kokoro TTS (Italian)

Kokoro TTS (Japanese)

Kokoro TTS (Mandarin Chinese)

Kokoro TTS (Spanish)

MiniMax Speech-02 HD

MiniMax Speech-02 HD

MiniMax Speech-02 Turbo

MiniMax Speech-02 Turbo

MiniMax Voice Cloning

MiniMax Voice Cloning

MiniMax Voice Design

Minimax