Groq
InferenceOptimization • US
Ultra-fast inference
💰 Paid Models (8)
| Model | Input/1M | Output/1M | Context | Capabilities | |
|---|---|---|---|---|---|
| Llama 3 8B llama-3-8b | $0.050 | $0.080 | 8K | chat | Compare |
| Llama 3.1 8B llama-3-1-8b | $0.050 | $0.080 | 128K | chatfunction_calling | Compare |
| Gemma 2 9B gemma-2-9b | $0.200 | $0.200 | 8K | chat | Compare |
| Mixtral 8x7B mixtral-8x7b | $0.240 | $0.240 | 33K | chat | Compare |
| Llama 3 70B llama-3-70b | $0.590 | $0.790 | 8K | chat | Compare |
| Llama 3.1 70B llama-3-1-70b | $0.590 | $0.790 | 128K | chatfunction_calling | Compare |
| Llama 3.3 70B llama-3-3-70b | $0.590 | $0.790 | 128K | chatfunction_calling | Compare |
| DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b | $0.750 | $0.990 | 128K | chatreasoning | Compare |