DeepInfra Provider
DeepInfra inference platform with OpenAI-compatible API for hosting open-source models.
Available Models
DeepSeek V4 Pro
deepseek
deepseek-v4-proStreaming
Tools
Reasoning
JSON Output
DeepInfra
Context: 64k
Input
$1.74
/M tokens
Cached
$0.145
/M tokens
Output
$3.48
/M tokens
DeepSeek V4 Flash
deepseek
deepseek-v4-flashStreaming
Tools
Reasoning
JSON Output
DeepInfra
Context: 1M
Input
$0.14
/M tokens
Cached
$0.028
/M tokens
Output
$0.28
/M tokens
GLM-5.1
glm
glm-5.1Streaming
Tools
Reasoning
JSON Output
DeepInfra
Context: 198k
Input
$1.05
/M tokens
Cached
$0.205
/M tokens
Output
$3.5
/M tokens
Kimi K2.5
moonshot
kimi-k2.5Streaming
Vision
Tools
Reasoning
JSON Output
DeepInfra
Context: 256k
Input
$0.45
/M tokens
Cached
$0.07
/M tokens
Output
$2.25
/M tokens
DeepSeek V3.2
deepseek
deepseek-v3.2Streaming
Tools
JSON Output
DeepInfra
Context: 160k
Input
$0.26
/M tokens
Cached
$0.13
/M tokens
Output
$0.38
/M tokens