Groq Provider

Groq's ultra-fast LPU inference with various models

Available Models

gemma2-9b-it
Groq
JSON OutputStreaming8.1k context

Context: 8.1k

$0.20 in/$0.20 out

llama-guard-4-12b
Groq
JSON OutputStreaming131.1k context

Context: 131.1k

$0.20 in/$0.20 out

deepseek-r1-distill-llama-70b
Groq
JSON OutputStreaming131.1k context

Context: 131.1k

$0.75 in/$0.99 out