All models

DeepSeek: DeepSeek V4 Flash

Premium
deepseek/deepseek-v4-flash

About this model

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. Released April 24, 2026. Designed for fast inference and high-throughput workloads while maintaining strong reasoning and coding performance. Includes hybrid attention for efficient long-context processing. Reasoning efforts high and xhigh are supported; xhigh maps to max reasoning. Well suited for coding assistants, chat systems, and agent workflows. Pricing: $0.14/M input, $0.28/M output.