MiMo V2.5

Xiaomi's full-modal perception model supporting native understanding of images, videos, audio, and text with 1M context. Agent performance comparable to MiMo V2.5 Pro.

mimo-v2.5
STABLEGet Started
Streaming
Vision
Tools
Reasoning
JSON Output

Select Provider

Xiaomi Pricing for MiMo V2.5

View detailed pricing and capabilities for this provider.

Xiaomi
Context: 1M
Input
$0.4
/M tokens
Cached
$0.08
/M tokens
Output
$2
/M tokens
Get Started