Browse and compare AI models across providers, modalities, and use cases.
Showing 20 of 52 models
AI Aging Generator performs controllable age progression or regression from a single face photo, generating lifelike portraits across eight age groups from baby to senior.
AI Detector (Image) is an advanced service that analyzes a single picture and returns a verdict on whether it was likely created by AI.
AI-FaceSwap-Video is a service that can replace a person's face throughout a video clip while keeping their movements natural.
AI-FaceSwap-Image is a service that can take one person's face and realistically blend it onto another's in a photo.
A audio understanding model to analyze audio content and answer questions about what's happening in the audio based on user prompts.
Video background removal version of bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS)
bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS)
bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS)
Bria RMBG 2.0 enables seamless removal of backgrounds from images, ideal for professional editing tasks. Trained exclusively on licensed data for safe and risk-free commercial use. Model weights for commercial use are available here: https://share-eu1.hsforms.com/2GLpEVQqJTI2Lj7AMYwgfIwf4e04
Swap faces of one or two people at once, while preserving user and scene details!
Eye Correct is a video-to-video model that can correct eye direction in videos. It can be used to correct eye direction in videos.
Generate high-quality images from depth maps using Flux.1 [dev] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.
Generate high-quality images from depth maps using Flux.1 [pro] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.
Generate high-quality images from depth maps using Flux.1 [pro] depth estimation model with a fine-tuned LoRA. The model produces accurate depth representations for scene understanding and 3D visualization.
GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.