Google

Google: Gemma 3 12B

google/gemma-3-12b-it

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Context window: 131,072 tokens
Input: text, image
Output: text
Pricing: $0.05/M input tokens, $0.15/M output tokens

View on OpenRouter. Model data sourced from OpenRouter.