MiMo V2 Omni

Xiaomi's multimodal model supporting text, vision, and speech modalities with 256K context window.

mimo-v2-omni
STABLEGet Started
Streaming
Vision
Tools
JSON Output

Select Provider

Xiaomi Pricing for MiMo V2 Omni

View detailed pricing and capabilities for this provider.

Xiaomi
Context: 256k
Input
$0.4
/M tokens
Cached
$0.08
/M tokens
Output
$2
/M tokens
Get Started