Qwen3 4B FP8

Lightweight Qwen 3 4B with FP8 quantization.

qwen3-4b-fp8
STABLE
128,000 context
Starting at $0.03/M input tokens
Starting at $0.03/M output tokens
Streaming

Select Provider

All Providers for Qwen3 4B FP8

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

NovitaAI

novita/qwen3-4b-fp8
Context Size
128k
Stability
STABLE
Pricing
Input
$0.03
/M
Cached
Output
$0.03
/M
Capabilities
Streaming
Try in Playground