fal-ai

MoonDreamNext

fal/moondreamnext

MoonDreamNext is a multimodal vision-language model for captioning, gaze detection, bbox detection, point detection, and more.

  • Input: text, image
  • Output: text, image

View on OpenRouter. Model data sourced from OpenRouter.