Xiaomi's full-modal perception model supporting native understanding of images, videos, audio, and text with 1M context. Agent performance comparable to MiMo V2.5 Pro.
mimo-v2.5
View detailed pricing and capabilities for this provider.
AI-powered help
Please introduce yourself before we start.