DeepSeek
DeepSeek: DeepSeek V4 Flash
deepseek/deepseek-v4-flash
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and high-throughput workloads, while maintaining strong reasoning and coding performance. The model includes hybrid attention for efficient long-context processing. Reasoning efforts `high` and `xhigh` are supported; `xhigh` maps to max reasoning. It is well suited for applications such as coding assistants, chat systems, and agent workflows where responsiveness and cost efficiency are important.
- Context window: 1,048,576 tokens
- Input: text
- Output: text
- Pricing: $0.0983/M input tokens, $0.1966/M output tokens
View on OpenRouter. Model data sourced from OpenRouter.