Models Directory

Browse and compare AI models across providers, modalities, and use cases.

Showing 20 of 1.0K models

Advanced Search

Active Filters

Out: Text

Ai Detector

AI Detector (Image) is an advanced service that analyzes a single picture and returns a verdict on whether it was likely created by AI.

View Details

Any LLM

Use any large language model from our selected catalogue (powered by OpenRouter)

View Details

Any VLM

Use any vision language model from our selected catalogue (powered by OpenRouter)

View Details

Arbiter

Semantic image alignment measurements

View Details

Arbiter

Image reference comparison measurements

View Details

Arbiter

Reference-free image measurements

View Details

ElevenLabs Speech to Text

Generate text from speech using ElevenLabs advanced speech-to-text model.

View Details

ElevenLabs Speech to Text - Scribe V2

Use Scribe-V2 from ElevenLabs to do blazingly fast speech to text inferences!

View Details

FFmpeg API Metadata

Get encoding metadata from video and audio files using FFmpeg API.

View Details

FFmpeg API Waveform

Get waveform data from audio files using FFmpeg API.

View Details

Ffmpeg Api

Get EBU R128 loudness normalization from audio files using FFmpeg API.

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.

View Details

Isaac 0.1

Isaac-01 is a multimodal vision-language model from Perceptron for various vision language tasks.

View Details

Isaac 0.1 [OpenAI Compatible Endpoint]

OpenAI spec compatible endpoint of Isaac-01 which is a multimodal vision-language model from Perceptron for various vision language tasks.

View Details

Showing 20 of 1.0K models

Advanced Search

Active Filters

Out: Text

Ai Detector

AI Detector (Image) is an advanced service that analyzes a single picture and returns a verdict on whether it was likely created by AI.

View Details

Any LLM

Use any large language model from our selected catalogue (powered by OpenRouter)

View Details

Any VLM

Use any vision language model from our selected catalogue (powered by OpenRouter)

View Details

Arbiter

Semantic image alignment measurements

View Details

Arbiter

Image reference comparison measurements

View Details

Arbiter

Reference-free image measurements

View Details

ElevenLabs Speech to Text

Generate text from speech using ElevenLabs advanced speech-to-text model.

View Details

ElevenLabs Speech to Text - Scribe V2

Use Scribe-V2 from ElevenLabs to do blazingly fast speech to text inferences!

View Details

FFmpeg API Metadata

Get encoding metadata from video and audio files using FFmpeg API.

View Details

FFmpeg API Waveform

Get waveform data from audio files using FFmpeg API.

View Details

Ffmpeg Api

Get EBU R128 loudness normalization from audio files using FFmpeg API.

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

Florence-2 Large

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

View Details

GOT OCR 2.0

View Details

Isaac 0.1

Isaac-01 is a multimodal vision-language model from Perceptron for various vision language tasks.

View Details

Isaac 0.1 [OpenAI Compatible Endpoint]

OpenAI spec compatible endpoint of Isaac-01 which is a multimodal vision-language model from Perceptron for various vision language tasks.

View Details

Models Directory

Advanced Search

Active Filters

Use Cases

ModalityOut: 1

License

Inference Medium

Provider

Languages

Context Length

Parameter Range

Input Price

Output Price

Ai Detector

Any LLM

Any VLM

Arbiter

Arbiter

Arbiter

ElevenLabs Speech to Text

ElevenLabs Speech to Text - Scribe V2

FFmpeg API Metadata

FFmpeg API Waveform

Ffmpeg Api

Florence-2 Large

Florence-2 Large

Florence-2 Large

Florence-2 Large

Florence-2 Large

Florence-2 Large

GOT OCR 2.0

Isaac 0.1

Isaac 0.1 [OpenAI Compatible Endpoint]

Advanced Search

Active Filters

Use Cases

ModalityOut: 1

License

Inference Medium

Provider

Languages

Context Length

Parameter Range

Input Price

Output Price

Ai Detector

Any LLM

Any VLM

Arbiter

Arbiter

Arbiter

ElevenLabs Speech to Text

ElevenLabs Speech to Text - Scribe V2

FFmpeg API Metadata

FFmpeg API Waveform

Ffmpeg Api

Florence-2 Large

Florence-2 Large

Florence-2 Large

Florence-2 Large

Florence-2 Large

Florence-2 Large

GOT OCR 2.0

Isaac 0.1

Isaac 0.1 [OpenAI Compatible Endpoint]