by Fal • Released Jan 9, 2025
MoonDreamNext is a multimodal vision-language model for captioning, gaze detection, bbox detection, point detection, and more.
AI Model Provider