thudm

THUDM: GLM 4.1V 9B Thinking

thudm/glm-4.1v-9b-thinking

GLM-4.1V-9B-Thinking is a 9B parameter vision-language model developed by THUDM, based on the GLM-4-9B foundation. It introduces a reasoning-centric "thinking paradigm" enhanced with reinforcement learning to improve multimodal reasoning, long-context understanding (up to 64K tokens), and complex problem solving. It achieves state-of-the-art performance among models in its class, outperforming even larger models like Qwen-2.5-VL-72B on a majority of benchmark tasks.

Context window: 65,536 tokens
Input: image, text
Output: text

View on OpenRouter. Model data sourced from OpenRouter.