Qwen3 4B FP8

Lightweight Qwen 3 4B with FP8 quantization.

qwen3-4b-fp8
STABLEGet Started
128,000 context
Starting at $0.03/M input tokens
Starting at $0.03/M output tokens
Streaming

Select Provider

All Providers for Qwen3 4B FP8

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

NovitaAI
Context: 128k
Input
$0.03
/M tokens
Cached
/M tokens
Output
$0.03
/M tokens
Get Started