Qwen
Qwen: Qwen3.6 35B A3B
qwen/qwen3.6-35b-a3b
Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated DeltaNet linear attention with standard gated attention layers, enabling efficient inference at a fraction of the compute cost. The model supports a 262K token native context window (extensible to 1M via YaRN) and accepts text, image, and video inputs. It includes integrated thinking mode with reasoning traces preserved across multi-turn conversations, function calling, and structured output. Released under the Apache 2.0 license.
- Context window: 262,144 tokens
- Input: text, image, video
- Output: text
- Pricing: $0.14/M input tokens, $1/M output tokens
View on OpenRouter. Model data sourced from OpenRouter.