Qwen3 4B FP8

Lightweight Qwen 3 4B with FP8 quantization.

qwen3-4b-fp8
STABLE
128,000 context
Starting at $0.03/M input tokens
Starting at $0.03/M output tokens
Streaming

Select Provider

All Providers for Qwen3 4B FP8

LangRouter routes requests to the best providers that are able to handle your prompt size and parameters.

NovitaAI

novita/qwen3-4b-fp8
Context Size
128k
Stability
STABLE
Pricing
Input
$0.03
/M
Cached
Output
$0.03
/M
Capabilities
Streaming
Try in Playground