MiMo V2 Flash

Xiaomi's high-efficiency inference model with hybrid architecture, 3 MTP layers for 2.5-3.7x faster inference, and 256K context.

mimo-v2-flash
STABLEGet Started
Streaming
Tools
Reasoning
JSON Output

Select Provider

Xiaomi Pricing for MiMo V2 Flash

View detailed pricing and capabilities for this provider.

Xiaomi
Context: 256k
Input
$0.1
/M tokens
Cached
$0.01
/M tokens
Output
$0.3
/M tokens
Get Started