Qwen
Qwen: Qwen VL Plus
qwen/qwen-vl-plus
Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.
- Context window: 7,500 tokens
- Input: text, image
- Output: text
View on OpenRouter. Model data sourced from OpenRouter.