← Back to Providers
Deepinfra logo

DeepInfra

AggregationUS
Total Models86
Free Models0
Paid Models86

Inference service

💰 Paid Models (86)

ModelInput/1MOutput/1MContextCapabilities
Llama-3.2-3B-Instruct
meta-llama-llama-32-3b-instruct
$0.020$0.020131Kchat
Meta-Llama-3.1-8B-Instruct-Turbo
meta-llama-meta-llama-31-8b-instruct-turbo
$0.020$0.030131Kchat
Mistral-Nemo-Instruct-2407
mistralai-mistral-nemo-instruct-2407
$0.020$0.040131Kchat
Meta-Llama-3-8B-Instruct
meta-llama-meta-llama-3-8b-instruct
$0.030$0.0608Kchat
Meta-Llama-3.1-8B-Instruct
meta-llama-meta-llama-31-8b-instruct
$0.030$0.050131Kchat
gpt-oss-20b
openai-gpt-oss-20b
$0.030$0.140131Kchatreasoning
DeepSeek-OCR
deepseek-ai-deepseek-ocr
$0.030$0.1008Kchat
gemini-1.5-flash-8b
google-gemini-15-flash-8b
$0.037$0.1501.0Mchatreasoning
gpt-oss-120b
openai-gpt-oss-120b
$0.039$0.190131Kchatreasoning
L3-8B-Lunaris-v1-Turbo
sao10k-l3-8b-lunaris-v1-turbo
$0.040$0.0508Kchat
gemma-3-4b-it
google-gemma-3-4b-it
$0.040$0.080131Kchat
NVIDIA-Nemotron-Nano-9B-v2
nvidia-nvidia-nemotron-nano-9b-v2
$0.040$0.160131Kchat
gemma-3-12b-it
google-gemma-3-12b-it
$0.040$0.130131Kchat
Llama-3.2-11B-Vision-Instruct
meta-llama-llama-32-11b-vision-instruct
$0.049$0.049131Kchatvision
Nemotron-3-Nano-30B-A3B
nvidia-nemotron-3-nano-30b-a3b
$0.050$0.200262Kchat
Mistral-Small-24B-Instruct-2501
mistralai-mistral-small-24b-instruct-2501
$0.050$0.08033Kchat
GLM-4.7-Flash
zai-org-glm-47-flash
$0.060$0.400203Kchat
phi-4
microsoft-phi-4
$0.070$0.14016Kchat
Qwen3-235B-A22B-Instruct-2507
qwen-qwen3-235b-a22b-instruct-2507
$0.071$0.463262Kchatreasoning
gemini-1.5-flash
google-gemini-15-flash
$0.075$0.3001.0Mchatreasoning
Mistral-Small-3.2-24B-Instruct-2506
mistralai-mistral-small-32-24b-instruct-2506
$0.075$0.200128Kchat
Qwen3-30B-A3B
qwen-qwen3-30b-a3b
$0.080$0.29041Kchatreasoning
Llama-4-Scout-17B-16E-Instruct
meta-llama-llama-4-scout-17b-16e-instruct
$0.080$0.300328Kchat
MythoMax-L2-13b
gryphe-mythomax-l2-13b
$0.080$0.0804Kchat
Qwen3-32B
qwen-qwen3-32b
$0.080$0.28041Kchatreasoning
Qwen3-14B
qwen-qwen3-14b
$0.080$0.24041Kchatreasoning
Qwen3-Next-80B-A3B-Instruct
qwen-qwen3-next-80b-a3b-instruct
$0.090$1.10262Kchatreasoning
olmOCR-2-7B-1025
allenai-olmocr-2-7b-1025
$0.090$0.19016Kchat
gemma-3-27b-it
google-gemma-3-27b-it
$0.090$0.160131Kchat
Llama-3.3-70B-Instruct-Turbo
meta-llama-llama-33-70b-instruct-turbo
$0.100$0.320131Kchat
Llama-3.3-Nemotron-Super-49B-v1.5
nvidia-llama-33-nemotron-super-49b-v15
$0.100$0.400131Kchat
Qwen2.5-72B-Instruct
qwen-qwen25-72b-instruct
$0.120$0.39033Kchat
PaddleOCR-VL-0.9B
paddlepaddle-paddleocr-vl-09b
$0.140$0.80016Kchatvision
Llama-4-Maverick-17B-128E-Instruct-FP8
meta-llama-llama-4-maverick-17b-128e-instruct-fp8
$0.150$0.6001.0Mchat
gpt-oss-120b-Turbo
openai-gpt-oss-120b-turbo
$0.150$0.600131Kchatreasoning
Qwen3-VL-30B-A3B-Instruct
qwen-qwen3-vl-30b-a3b-instruct
$0.150$0.600262Kchatvisionreasoning
Llama-Guard-4-12B
meta-llama-llama-guard-4-12b
$0.180$0.180164Kchat
Qwen2.5-VL-32B-Instruct
qwen-qwen25-vl-32b-instruct
$0.200$0.600128Kchatvision
NVIDIA-Nemotron-Nano-12B-v2-VL
nvidia-nvidia-nemotron-nano-12b-v2-vl
$0.200$0.600131Kchatvision
DeepSeek-V3-0324
deepseek-ai-deepseek-v3-0324
$0.200$0.880164Kchat
Qwen3-VL-235B-A22B-Instruct
qwen-qwen3-vl-235b-a22b-instruct
$0.200$1.20262Kchatvisionreasoning
Olmo-3.1-32B-Instruct
allenai-olmo-31-32b-instruct
$0.200$0.60066Kchat
DeepSeek-V3.1-Terminus
deepseek-ai-deepseek-v31-terminus
$0.210$0.790164Kchatreasoning
DeepSeek-V3.1
deepseek-ai-deepseek-v31
$0.210$0.790164Kchatreasoning
Qwen3-235B-A22B-Thinking-2507
qwen-qwen3-235b-a22b-thinking-2507
$0.230$2.39262Kchatreasoning
DeepSeek-V3.2
deepseek-ai-deepseek-v32
$0.260$0.380164Kchat
Qwen3-Coder-480B-A35B-Instruct-Turbo
qwen-qwen3-coder-480b-a35b-instruct-turbo
$0.280$1.20262Kchatcode
MiniMax-M2.1
minimaxai-minimax-m21
$0.280$1.20197Kchat
Hermes-3-Llama-3.1-70B
nousresearch-hermes-3-llama-31-70b
$0.300$0.300131Kchat
gemini-2.5-flash
google-gemini-25-flash
$0.300$2.501.0Mchatreasoning
GLM-4.6V
zai-org-glm-46v
$0.300$0.900131Kchat
DeepSeek-V3
deepseek-ai-deepseek-v3
$0.320$0.890164Kchat
Kimi-K2-Instruct-0905
moonshotai-kimi-k2-instruct-0905
$0.400$2.00131Kchat
Meta-Llama-3.1-70B-Instruct
meta-llama-meta-llama-31-70b-instruct
$0.400$0.400131Kchat
GLM-4.7
zai-org-glm-47
$0.400$1.90203Kchat
Qwen3-Coder-480B-A35B-Instruct
qwen-qwen3-coder-480b-a35b-instruct
$0.400$1.60262Kchatcode
Meta-Llama-3.1-70B-Instruct-Turbo
meta-llama-meta-llama-31-70b-instruct-turbo
$0.400$0.400131Kchat
GLM-4.6
zai-org-glm-46
$0.430$1.75203Kchat
Kimi-K2.5
moonshotai-kimi-k25
$0.450$2.80262Kchat
Kimi-K2-Thinking
moonshotai-kimi-k2-thinking
$0.470$2.00131Kchat
WizardLM-2-8x22B
microsoft-wizardlm-2-8x22b
$0.480$0.48066Kchat
DeepSeek-R1-0528
deepseek-ai-deepseek-r1-0528
$0.500$2.15164Kchat
Mixtral-8x7B-Instruct-v0.1
mistralai-mixtral-8x7b-instruct-v01
$0.540$0.54033Kchat
DeepSeek-R1-Distill-Llama-70B
deepseek-ai-deepseek-r1-distill-llama-70b
$0.600$1.20131Kchat
L3.3-70B-Euryale-v2.3
sao10k-l33-70b-euryale-v23
$0.850$0.850131Kchat
L3.1-70B-Euryale-v2.2
sao10k-l31-70b-euryale-v22
$0.850$0.850131Kchat
Hermes-3-Llama-3.1-405B
nousresearch-hermes-3-llama-31-405b
$1.00$1.00131Kchat
DeepSeek-R1-0528-Turbo
deepseek-ai-deepseek-r1-0528-turbo
$1.00$3.0033Kchat
Llama-3.1-Nemotron-70B-Instruct
nvidia-llama-31-nemotron-70b-instruct
$1.20$1.20131Kchat
gemini-2.5-pro
google-gemini-25-pro
$1.25$10.001.0Mchatreasoning
claude-4-sonnet
anthropic-claude-4-sonnet
$3.30$16.50200Kchatreasoning
claude-3-7-sonnet-latest
anthropic-claude-3-7-sonnet-latest
$3.30$16.50200Kchatreasoning
claude-4-opus
anthropic-claude-4-opus
$16.50$82.50200Kchatreasoning
e5-base-v2
e5-base-v2
$512.00$0.0050-chat
all-MiniLM-L12-v2
all-minilm-l12-v2
$512.00$0.0050-chat
multi-qa-mpnet-base-dot-v1
multi-qa-mpnet-base-dot-v1
$512.00$0.0050-chat
bge-large-en-v1.5
bge-large-en-v1-5
$512.00$0.010-chat
bge-base-en-v1.5
bge-base-en-v1-5
$512.00$0.0050-chat
all-MiniLM-L6-v2
all-minilm-l6-v2
$512.00$0.0050-chat
gte-base
gte-base
$512.00$0.0050-chat
text2vec-base-chinese
text2vec-base-chinese
$512.00$0.0050-chat
all-mpnet-base-v2
all-mpnet-base-v2
$512.00$0.0050-chat
gte-large
gte-large
$512.00$0.010-chat
paraphrase-MiniLM-L6-v2
paraphrase-minilm-l6-v2
$512.00$0.0050-chat
multilingual-e5-large
multilingual-e5-large
$512.00$0.010-chat
e5-large-v2
e5-large-v2
$512.00$0.010-chat