← Back to Providers
Groq logo

Groq

InferenceOptimizationUS
Total Models17
Free Models0
Paid Models17

Ultra-fast inference

💰 Paid Models (17)

ModelInput/1MOutput/1MContextCapabilities
Llama 3.1 8B
llama-3-1-8b
$0.050$0.080128Kchatfunction_calling
Llama 3 8B
llama-3-8b
$0.050$0.0808Kchat
Llama 3.1 8B Instant
llama-3-1-8b-instant
$0.050$0.080128Kchatfunction_calling
GPT-OSS 20B
gpt-oss-20b
$0.075$0.300128Kchat
Llama 4 Scout
llama-4-scout
$0.110$0.340128Kchatfunction_calling
GPT-OSS 120B
gpt-oss-120b
$0.150$0.600128Kchat
Llama 4 Maverick
llama-4-maverick
$0.200$0.600128Kchatfunction_calling
Gemma 2 9B
gemma-2-9b
$0.200$0.2008Kchat
Llama Guard 4 12B
llama-guard-4-12b
$0.200$0.200128Kchatmoderation
Mixtral 8x7B
mixtral-8x7b
$0.240$0.24033Kchat
Qwen3 32B
qwen3-32b
$0.290$0.590131Kchatfunction_calling
Llama 3 70B
llama-3-70b
$0.590$0.7908Kchat
Llama 3.3 70B
llama-3-3-70b
$0.590$0.790128Kchatfunction_calling
Llama 3.1 70B
llama-3-1-70b
$0.590$0.790128Kchatfunction_calling
Llama 3.3 70B Versatile
llama-3-3-70b-versatile
$0.590$0.790128Kchatfunction_calling
DeepSeek R1 Distill Llama 70B
deepseek-r1-distill-llama-70b
$0.750$0.990128Kchatreasoning
Kimi K2
kimi-k2
$1.00$3.00256Kchatreasoning