Cerebras Provider

Cerebras high-performance inference with ultra-fast throughput

Available Models

GPT OSS 120B

openai
gpt-oss-120b

Providers

Cerebras
cerebras/gpt-oss-120b
Context Size
131.1k
Stability
STABLE
Pricing
Input
$0.35/M
Cached
/M
Output
$0.75/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Try in Playground

Llama 3.1 8B Instruct

meta
llama-3.1-8b-instruct

Providers

Cerebras
cerebras/llama-3.1-8b-instruct
Context Size
128k
Stability
STABLE
Pricing
Input
$0.10/M
Cached
/M
Output
$0.10/M
Capabilities
Streaming
JSON Output
Try in Playground

Llama 3.3 70B Instruct

meta
llama-3.3-70b-instruct

Providers

Cerebras
cerebras/llama-3.3-70b-instruct
Context Size
128k
Stability
STABLE
Pricing
Input
$0.85/M
Cached
/M
Output
$1.20/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 235B A22B Instruct 2507

alibaba
qwen3-235b-a22b-instruct-2507

Providers

Cerebras
cerebras/qwen3-235b-a22b-instruct-2507
Context Size
262k
Stability
STABLE
Pricing
Input
$0.60/M
Cached
/M
Output
$1.20/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

Qwen3 32B

alibaba
qwen3-32b

Providers

Cerebras
cerebras/qwen3-32b
Context Size
32.8k
Stability
STABLE
Pricing
Input
$0.40/M
Cached
/M
Output
$0.80/M
Capabilities
Streaming
Tools
JSON Output
Try in Playground

GLM-4.6

glm
glm-4.6

Providers

Cerebras
cerebras/glm-4.6
Context Size
200k
Stability
unstable
Pricing
Input
$2.25/M
Cached
/M
Output
$2.75/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Try in Playground
    Cerebras - LLM Gateway