DeepInfra Provider

DeepInfra inference platform with OpenAI-compatible API for hosting open-source models.

Available Models

DeepSeek V4 Pro

deepseek
deepseek-v4-pro
Streaming
Tools
Reasoning
JSON Output
DeepInfra
Context: 64k
Input
$1.74
/M tokens
Cached
$0.145
/M tokens
Output
$3.48
/M tokens

DeepSeek V4 Flash

deepseek
deepseek-v4-flash
Streaming
Tools
Reasoning
JSON Output
DeepInfra
Context: 1M
Input
$0.14
/M tokens
Cached
$0.028
/M tokens
Output
$0.28
/M tokens

GLM-5.1

glm
glm-5.1
Streaming
Tools
Reasoning
JSON Output
DeepInfra
Context: 198k
Input
$1.05
/M tokens
Cached
$0.205
/M tokens
Output
$3.5
/M tokens

Kimi K2.5

moonshot
kimi-k2.5
Streaming
Vision
Tools
Reasoning
JSON Output
DeepInfra
Context: 256k
Input
$0.45
/M tokens
Cached
$0.07
/M tokens
Output
$2.25
/M tokens

DeepSeek V3.2

deepseek
deepseek-v3.2
Streaming
Tools
JSON Output
DeepInfra
Context: 160k
Input
$0.26
/M tokens
Cached
$0.13
/M tokens
Output
$0.38
/M tokens