Z.ai

Z.ai: GLM 5V Turbo

z-ai/glm-5v-turbo

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding, and task execution, and works seamlessly with agents to complete the full loop of “perceive → plan → execute“.

  • Context window: 202,752 tokens
  • Input: image, text, video
  • Output: text
  • Pricing: $1.2/M input tokens, $4/M output tokens

View on OpenRouter. Model data sourced from OpenRouter.