Nemotron 3 Ultra 550B

NVIDIA's most capable model with 550B parameters for complex reasoning, coding, and multimodal tasks.

nemotron-3-ultra-550b
STABLEGet StartedView uptime
262,144 context
Starting at $0.50/M input tokens
Starting at $2.50/M output tokens
Streaming
Vision
Tools
JSON Output

Select Provider

All Providers for Nemotron 3 Ultra 550B

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

DeepInfra
Context: 262.1k
Input
$0.5
/M tokens
Cached
$0.15
/M tokens
Output
$2.5
/M tokens
Get Started