Nebius AI Provider
Nebius AI Studio - OpenAI-compatible API for large language models
Available Models
llama-3.1-8b-instruct
meta-llama/Meta-Llama-3.1-8B-Instruct
nebius/llama-3.1-8b-instruct
Context: 128k
$0.02 in/$0.06 out
llama-3.1-nemotron-ultra-253b
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
nebius/llama-3.1-nemotron-ultra-253b
Context: 128k
$0.60 in/$1.80 out
llama-3.3-70b-instruct
meta-llama/Llama-3.3-70B-Instruct
nebius/llama-3.3-70b-instruct
Context: 128k
$0.13 in/$0.40 out
llama-3.1-405b-instruct
meta-llama/Meta-Llama-3.1-405B-Instruct
nebius/llama-3.1-405b-instruct
Context: 128k
$1.00 in/$3.00 out
deepseek-v3
deepseek-ai/DeepSeek-V3
cloudrift/deepseek-v3
Context: 163.8k
$0.15 in/$0.40 out
deepseek-r1-0528
deepseek-ai/DeepSeek-R1-0528
cloudrift/deepseek-r1-0528
Context: 32.8k
$0.25 in/$1.00 out
qwen3-235b-a22b-instruct-2507
Qwen/Qwen3-235B-A22B-Instruct-2507
nebius/qwen3-235b-a22b-instruct-2507
Context: 262k
$0.20 in/$0.60 out
qwen3-235b-a22b-thinking-2507
Qwen/Qwen3-235B-A22B-Thinking-2507
nebius/qwen3-235b-a22b-thinking-2507
Context: 262k
$0.20 in/$0.60 out
qwen25-coder-7b
Qwen/Qwen2.5-Coder-7B-fast
nebius/qwen25-coder-7b
Context: 32.8k
$0.01 in/$0.03 out
qwen25-32b-instruct
Qwen/Qwen2.5-32B-Instruct
nebius/qwen25-32b-instruct
Context: 32.8k
$0.06 in/$0.20 out
qwen25-72b-instruct
Qwen/Qwen2.5-72B-Instruct
nebius/qwen25-72b-instruct
Context: 32.8k
$0.13 in/$0.40 out
qwen2-vl-72b-instruct
Qwen/Qwen2-VL-72B-Instruct
nebius/qwen2-vl-72b-instruct
Context: 32.8k
$0.13 in/$0.40 out
qwen2-5-vl-72b-instruct
Qwen/Qwen2.5-VL-72B-Instruct
nebius/qwen2-5-vl-72b-instruct
Context: 32.8k
$0.13 in/$0.40 out
qwen3-coder-480b-a35b-instruct
Qwen/Qwen3-Coder-480B-A35B-Instruct
nebius/qwen3-coder-480b-a35b-instruct
Context: 262k
$0.40 in/$1.80 out
qwen3-coder-30b-a3b-instruct
Qwen/Qwen3-Coder-30B-A3B-Instruct
nebius/qwen3-coder-30b-a3b-instruct
Context: 262k
$0.10 in/$0.30 out
qwen3-30b-a3b-instruct-2507
Qwen/Qwen3-30B-A3B-Instruct-2507
nebius/qwen3-30b-a3b-instruct-2507
Context: 262k
$0.10 in/$0.30 out
qwen3-30b-a3b-thinking-2507
Qwen/Qwen3-30B-A3B-Thinking-2507
nebius/qwen3-30b-a3b-thinking-2507
Context: 262k
$0.10 in/$0.30 out
hermes-3-llama-405b
NousResearch/Hermes-3-Llama-405B
nebius/hermes-3-llama-405b
Context: 131.1k
$1.00 in/$3.00 out