Browse and compare AI models across providers, modalities, and use cases.
Showing 11 of 11 models
Omnihuman v1.5 is a new and improved version of Omnihuman. It generates video using an image of a human figure paired with an audio file. It produces vivid, high-quality videos where the character’s emotions and movements maintain a strong correlation with the audio.
Generate high fidelity, studio quality videos of your avatar speaking or singing using the Aurora from Creatify team!
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
VEED Fabric 1.0 is an image-to-video API that turns any image into a talking video
Kling LipSync is an audio-to-video model that generates realistic lip movements from audio input.
Kling LipSync is a text-to-video model that generates realistic lip movements from text input.
OmniHuman generates video using an image of a human figure paired with an audio file. It produces vivid, high-quality videos where the character’s emotions and movements maintain a strong correlation with the audio.
Use React-1 from SyncLabs to refine human emotions and do realistic lip-sync without losing details!