Groq Provider

Groq's ultra-fast LPU inference with various models

Get started Try in Playground Visit company

Data & Privacy

HQ:US

API Training:No

Consumer Training:No

Prompt Logging:No

Retention:0 days

GDPR:Compliant

SOC2:Certified

Status Page

|Terms of Service

|Privacy Policy

Available Models

GPT OSS 120B

openai

gpt-oss-120b

Streaming

Tools

Reasoning

JSON Output

Groq

Context: 131.1k

Input

$0.15

/M tokens

Cached

—

/M tokens

Output

$0.75

/M tokens

Get Started

GPT OSS 20B

openai

gpt-oss-20b

Streaming

Tools

Reasoning

JSON Output

Groq

Context: 131.1k

Input

$0.1

/M tokens

Cached

—

/M tokens

Output

$0.5

/M tokens

Get Started

Kimi K2

moonshotModel Deactivated

kimi-k2

Streaming

Tools

JSON Output

Groq

Context: 131.1k

Deactivated since Jun 5, 2026

Input

/M tokens

Cached

$0.5

/M tokens

Output

/M tokens

Get Started

Llama Guard 4 12B

metaModel Deactivated

llama-guard-4-12b

Streaming

Groq

Context: 131.1k

Deactivated since Mar 29, 2026

Input

$0.2

/M tokens

Cached

—

/M tokens

Output

$0.2

/M tokens

Get Started

DeepSeek R1 Distill Llama 70B

deepseekModel Deactivated

deepseek-r1-distill-llama-70b

Streaming

Tools

JSON Output

Groq

Context: 131.1k

Deactivated since Oct 9, 2025

Input

$0.75

/M tokens

Cached

—

/M tokens

Output

$0.99

/M tokens

Get Started

Gemma2 9B IT

googleModel Deactivated

gemma2-9b-it

Streaming

Tools

Groq

Context: 8.1k

Deactivated since Oct 8, 2025

Input

$0.2

/M tokens

Cached

—

/M tokens

Output

$0.2

/M tokens

Get Started

Groq Provider

Data & Privacy

Available Models

GPT OSS 120B

GPT OSS 20B

Kimi K2

Llama Guard 4 12B

DeepSeek R1 Distill Llama 70B

Gemma2 9B IT

Stay ahead of the curve

Support

Welcome!