Qwen

Qwen: Qwen VL Plus

qwen/qwen-vl-plus

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.

Context window: 7,500 tokens
Input: text, image
Output: text

View on OpenRouter. Model data sourced from OpenRouter.