opengvlab

OpenGVLab: InternVL3 14B

opengvlab/internvl3-14b

The 14b version of the InternVL3 series. An advanced multimodal large language model (MLLM) series that demonstrates superior overall performance. Compared to InternVL 2.5, InternVL3 exhibits superior multimodal perception and reasoning capabilities, while further extending its multimodal capabilities to encompass tool usage, GUI agents, industrial image analysis, 3D vision perception, and more.

Context window: 32,000 tokens
Input: image, text
Output: text

View on OpenRouter. Model data sourced from OpenRouter.