Browse and compare AI models across providers, modalities, and use cases.
Showing 16 of 16 models
Generate text from speech using ElevenLabs advanced speech-to-text model.
Kokoro is a lightweight text-to-speech model that delivers comparable quality to larger models while being significantly faster and more cost-efficient.
A natural and expressive Brazilian Portuguese text-to-speech model optimized for clarity and fluency.
A high-quality British English text-to-speech model offering natural and expressive voice synthesis.
An expressive and natural French text-to-speech model for both European and Canadian French.
A fast and expressive Hindi text-to-speech model with clear pronunciation and accurate intonation.
A high-quality Italian text-to-speech model delivering smooth and expressive speech synthesis.
A fast and natural-sounding Japanese text-to-speech model optimized for smooth pronunciation.
A highly efficient Mandarin Chinese text-to-speech model that captures natural tones and prosody.
A natural-sounding Spanish text-to-speech model optimized for Latin American and European Spanish.
Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.
Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.
Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.
[Experimental] Whisper v3 Large -- but optimized by our inference wizards. Same WER, double the performance!