newai.today
HomeModelsBenchmarks
newai.today

Discover, compare and track AI models, their public releases and benchmark scores.

Resources

  • Models Directory
  • Benchmarks
  • API Documentation

Company

  • About
  • Blog
  • Contact

Legal

  • Privacy Policy
  • Terms of Service

© 2025 newai.today. All rights reserved.

Theme:

Models Directory

Browse and compare AI models across providers, modalities, and use cases.

Showing 20 of 61 models

Advanced Search

Active Filters

Out: Audio

PlayAI Text-to-Speech Dialog

Generate natural-sounding multi-speaker dialogues, and audio. Perfect for expressive outputs, storytelling, games, animations, and interactive media.

View Details

music generator

CassetteAI’s model generates a 30-second sample in under 2 seconds and a full 3-minute track in under 10 seconds. At 44.1 kHz stereo audio, expect a level of professional consistency with no breaks, no squeaks, and no random interruptions in your creations.

View Details

Aurora

To specify which model you want to use, set the model parameter in your API requests:

View Details

Blizzard

Highly conversational output with natural pacing and intonation; Better handling of filler words and casual speech; Instant voice cloning that better preserves accents and speaker styles

View Details

CSM-1B

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.

View Details

DiffRhythm: Lyrics to Song

DiffRhythm is a blazing fast model for transforming lyrics into full songs. It boasts the capability to generate full songs in less than 30 seconds.

View Details

ElevenLabs Audio Isolation

Isolate audio tracks using ElevenLabs advanced audio isolation technology.

View Details

ElevenLabs Sound Effects

Generate sound effects using ElevenLabs advanced sound effects model.

View Details

ElevenLabs TTS Multilingual v2

Generate multilingual text-to-speech audio using ElevenLabs TTS Multilingual v2.

View Details

ElevenLabs TTS Turbo v2.5

Generate high-speed text-to-speech audio using ElevenLabs TTS Turbo v2.5.

View Details

F5 TTS

F5 TTS

View Details

Kokoro TTS

Kokoro is a lightweight text-to-speech model that delivers comparable quality to larger models while being significantly faster and more cost-efficient.

View Details

Kokoro TTS (Brazilian Portuguese)

A natural and expressive Brazilian Portuguese text-to-speech model optimized for clarity and fluency.

View Details

Kokoro TTS (British English)

A high-quality British English text-to-speech model offering natural and expressive voice synthesis.

View Details

Kokoro TTS (French)

An expressive and natural French text-to-speech model for both European and Canadian French.

View Details

Kokoro TTS (Hindi)

A fast and expressive Hindi text-to-speech model with clear pronunciation and accurate intonation.

View Details

Kokoro TTS (Italian)

A high-quality Italian text-to-speech model delivering smooth and expressive speech synthesis.

View Details

Kokoro TTS (Japanese)

A fast and natural-sounding Japanese text-to-speech model optimized for smooth pronunciation.

View Details

Kokoro TTS (Mandarin Chinese)

A highly efficient Mandarin Chinese text-to-speech model that captures natural tones and prosody.

View Details

Kokoro TTS (Spanish)

A natural-sounding Spanish text-to-speech model optimized for Latin American and European Spanish.

View Details
  • Previous
  • 1
  • 2
  • 4
  • Next