fal-ai

Sa2VA 8B Image

fal/sa2va

Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels

  • Input: text
  • Output: text

View on OpenRouter. Model data sourced from OpenRouter.