newai.today
HomeModelsBenchmarks
newai.today

Discover, compare and track AI models, their public releases and benchmark scores.

Resources

  • Models Directory
  • Benchmarks
  • API Documentation

Company

  • About
  • Blog
  • Contact

Legal

  • Privacy Policy
  • Terms of Service

© 2025 newai.today. All rights reserved.

Theme:

Models Directory

Browse and compare AI models across providers, modalities, and use cases.

Showing 16 of 16 models

Advanced Search

Active Filters

speech

ElevenLabs Speech to Text

Generate text from speech using ElevenLabs advanced speech-to-text model.

View Details

F5 TTS

F5 TTS

View Details

Kokoro TTS

Kokoro is a lightweight text-to-speech model that delivers comparable quality to larger models while being significantly faster and more cost-efficient.

View Details

Kokoro TTS (Brazilian Portuguese)

A natural and expressive Brazilian Portuguese text-to-speech model optimized for clarity and fluency.

View Details

Kokoro TTS (British English)

A high-quality British English text-to-speech model offering natural and expressive voice synthesis.

View Details

Kokoro TTS (French)

An expressive and natural French text-to-speech model for both European and Canadian French.

View Details

Kokoro TTS (Hindi)

A fast and expressive Hindi text-to-speech model with clear pronunciation and accurate intonation.

View Details

Kokoro TTS (Italian)

A high-quality Italian text-to-speech model delivering smooth and expressive speech synthesis.

View Details

Kokoro TTS (Japanese)

A fast and natural-sounding Japanese text-to-speech model optimized for smooth pronunciation.

View Details

Kokoro TTS (Mandarin Chinese)

A highly efficient Mandarin Chinese text-to-speech model that captures natural tones and prosody.

View Details

Kokoro TTS (Spanish)

A natural-sounding Spanish text-to-speech model optimized for Latin American and European Spanish.

View Details

MiniMax Speech-02 HD

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Speech-02 Turbo

Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

MiniMax Voice Cloning

Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

View Details

Whisper

Whisper is a model for speech transcription and translation.

View Details

Wizper (Whisper v3 -- fal.ai edition)

[Experimental] Whisper v3 Large -- but optimized by our inference wizards. Same WER, double the performance!

View Details