opengvlab
OpenGVLab: InternVL3 14B
opengvlab/internvl3-14b
The 14b version of the InternVL3 series. An advanced multimodal large language model (MLLM) series that demonstrates superior overall performance. Compared to InternVL 2.5, InternVL3 exhibits superior multimodal perception and reasoning capabilities, while further extending its multimodal capabilities to encompass tool usage, GUI agents, industrial image analysis, 3D vision perception, and more.
- Context window: 32,000 tokens
- Input: image, text
- Output: text
View on OpenRouter. Model data sourced from OpenRouter.