by Fal • Released Jan 13, 2025
Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels
AI Model Provider