Browse and compare AI models across providers, modalities, and use cases.
Showing 20 of 42 models
Stable Diffusion 3.5 Large is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
Produce high-quality images with minimal inference steps.
Lumina-Image-2.0 is a 2 billion parameter flow-based diffusion transforer which features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
Produce high-quality images with minimal inference steps. Optimized for 512x512 input image size.
Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation