Browse and compare AI models across providers, modalities, and use cases.
Showing 4 of 4 models
Automatically generates text captions for your videos from the audio as per text colour/font specifications
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks