Browse and compare AI models across providers, modalities, and use cases.
Showing 20 of 1.0K models
AI Detector (Image) is an advanced service that analyzes a single picture and returns a verdict on whether it was likely created by AI.
Use any vision language model from our selected catalogue (powered by OpenRouter)
Generate text from speech using ElevenLabs advanced speech-to-text model.
Use Scribe-V2 from ElevenLabs to do blazingly fast speech to text inferences!
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.
Isaac-01 is a multimodal vision-language model from Perceptron for various vision language tasks.
OpenAI spec compatible endpoint of Isaac-01 which is a multimodal vision-language model from Perceptron for various vision language tasks.