thudm
THUDM: GLM 4.1V 9B Thinking
thudm/glm-4.1v-9b-thinking
GLM-4.1V-9B-Thinking is a 9B parameter vision-language model developed by THUDM, based on the GLM-4-9B foundation. It introduces a reasoning-centric "thinking paradigm" enhanced with reinforcement learning to improve multimodal reasoning, long-context understanding (up to 64K tokens), and complex problem solving. It achieves state-of-the-art performance among models in its class, outperforming even larger models like Qwen-2.5-VL-72B on a majority of benchmark tasks.
- Context window: 65,536 tokens
- Input: image, text
- Output: text
View on OpenRouter. Model data sourced from OpenRouter.