ibm-granite

IBM: Granite 4.1 8B

ibm-granite/granite-4.1-8b

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks including tool calling, retrieval-augmented generation (RAG), code generation with fill-in-the-middle support, text summarization, classification, and extraction. The model handles 12 languages (English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese) and implements OpenAI-compatible tool calling. Released under the Apache 2.0 license.

  • Context window: 131,072 tokens
  • Input: text
  • Output: text
  • Pricing: $0.05/M input tokens, $0.1/M output tokens

View on OpenRouter. Model data sourced from OpenRouter.