🆓 Free Models (2)
| Model | Context | Capabilities |
|---|---|---|
| Qwen3 Omni 30B A3B Thinking qwen-qwen3-omni-30b-a3b-thinking | 66K | chatfunction_callingstructured_output |
| Qwen3 Omni 30B A3B Instruct qwen-qwen3-omni-30b-a3b-instruct | 66K | chatfunction_callingstructured_output |
💰 Paid Models (81)
| Model | Input/1M | Output/1M | Context | Capabilities |
|---|---|---|---|---|
| PaddleOCR-VL paddlepaddle-paddleocr-vl | $0.200 | $0.200 | 16K | chatvision |
| Llama 3.1 8B Instruct meta-llama-llama-3.1-8b-instruct | $0.200 | $0.500 | 16K | chat |
| Qwen3 4B qwen-qwen3-4b-fp8 | $0.300 | $0.300 | 128K | chatfunction_callingreasoning |
| Llama 3.2 3B Instruct meta-llama-llama-3.2-3b-instruct | $0.300 | $0.500 | 33K | chat |
| DeepSeek-OCR 2 deepseek-deepseek-ocr-2 | $0.300 | $0.300 | 8K | chatvision |
| DeepSeek-OCR deepseek-deepseek-ocr | $0.300 | $0.300 | 8K | chatvision |
| Qwen3 8B qwen-qwen3-8b-fp8 | $0.350 | $1.38 | 128K | chat |
| AutoGLM-Phone-9B-Multilingual zai-org-autoglm-phone-9b-multilingual | $0.350 | $1.38 | 66K | chatvision |
| OpenAI: GPT OSS 20B openai-gpt-oss-20b | $0.400 | $1.50 | 131K | chatstructured_outputreasoning |
| Mistral Nemo mistralai-mistral-nemo | $0.400 | $1.70 | 60K | chatstructured_output |
| Llama 3 8B Instruct meta-llama-llama-3-8b-instruct | $0.400 | $0.400 | 8K | chat |
| L3 8B Stheno V3.2 sao10k-l3-8b-stheno-v3.2 | $0.500 | $0.500 | 8K | chatfunction_calling |
| Sao10k L3 8B Lunaris sao10k-l3-8b-lunaris | $0.500 | $0.500 | 8K | chatstructured_output |
| OpenAI GPT OSS 120B openai-gpt-oss-120b | $0.500 | $2.50 | 131K | chatfunction_callingstructured_output |
| Gemma3 12B google-gemma-3-12b-it | $0.500 | $1.00 | 131K | chatvision |
| DeepSeek R1 0528 Qwen3 8B deepseek-deepseek-r1-0528-qwen3-8b | $0.600 | $0.900 | 128K | chat |
| Qwen2.5 7B Instruct qwen-qwen2.5-7b-instruct | $0.700 | $0.700 | 32K | chatfunction_callingstructured_output |
| BaiChuan M2 32B baichuan-baichuan-m2-32b | $0.700 | $0.700 | 131K | chat |
| ERNIE 4.5 21B A3B baidu-ernie-4.5-21b-a3b | $0.700 | $2.80 | 120K | chatfunction_calling |
| GLM-4.7-Flash zai-org-glm-4.7-flash | $0.700 | $4.00 | 200K | chatfunction_callingstructured_output |
| ERNIE-4.5-21B-A3B-Thinking baidu-ernie-4.5-21b-a3b-thinking | $0.700 | $2.80 | 131K | chatreasoning |
| Qwen3 Coder 30b A3B Instruct qwen-qwen3-coder-30b-a3b-instruct | $0.700 | $2.70 | 160K | chatfunction_callingstructured_output |
| qwen/qwen3-vl-8b-instruct qwen-qwen3-vl-8b-instruct | $0.800 | $5.00 | 131K | chatfunction_callingstructured_output |
| Qwen3 235B A22B Instruct 2507 qwen-qwen3-235b-a22b-instruct-2507 | $0.900 | $5.80 | 131K | chatfunction_callingstructured_output |
| Qwen3 30B A3B qwen-qwen3-30b-a3b-fp8 | $0.900 | $4.50 | 41K | chatfunction_callingreasoning |
| Mythomax L2 13B gryphe-mythomax-l2-13b | $0.900 | $0.900 | 4K | chat |
| Qwen3 32B qwen-qwen3-32b-fp8 | $1.00 | $4.50 | 41K | chatreasoning |
| XiaomiMiMo/MiMo-V2-Flash xiaomimimo-mimo-v2-flash | $1.00 | $3.00 | 262K | chatfunction_callingstructured_output |
| Gemma 3 27B google-gemma-3-27b-it | $1.19 | $2.00 | 98K | chatvision |
| zai-org/glm-4.5-air zai-org-glm-4.5-air | $1.30 | $8.50 | 131K | chatfunction_callingreasoning |
| Llama 3.3 70B Instruct meta-llama-llama-3.3-70b-instruct | $1.35 | $4.00 | 131K | chatfunction_calling |
| Hermes 2 Pro Llama 3 8B nousresearch-hermes-2-pro-llama-3-8b | $1.40 | $1.40 | 8K | chatstructured_output |
| ERNIE 4.5 VL 28B A3B baidu-ernie-4.5-vl-28b-a3b | $1.40 | $5.60 | 30K | chatfunction_callingreasoning |
| Qwen3 Next 80B A3B Instruct qwen-qwen3-next-80b-a3b-instruct | $1.50 | $15.00 | 131K | chatfunction_callingstructured_output |
| Qwen3 Next 80B A3B Thinking qwen-qwen3-next-80b-a3b-thinking | $1.50 | $15.00 | 131K | chatfunction_callingstructured_output |
| DeepSeek R1 Distill Qwen 14B deepseek-deepseek-r1-distill-qwen-14b | $1.50 | $1.50 | 33K | chat |
| Llama 4 Scout Instruct meta-llama-llama-4-scout-17b-16e-instruct | $1.80 | $5.90 | 131K | chatvision |
| qwen/qwen3-vl-30b-a3b-thinking qwen-qwen3-vl-30b-a3b-thinking | $2.00 | $10.00 | 131K | chatfunction_callingstructured_output |
| Qwen3 235B A22B qwen-qwen3-235b-a22b-fp8 | $2.00 | $8.00 | 41K | chatreasoning |
| qwen/qwen3-vl-30b-a3b-instruct qwen-qwen3-vl-30b-a3b-instruct | $2.00 | $7.00 | 131K | chatfunction_callingstructured_output |
| Qwen MT Plus qwen-qwen-mt-plus | $2.50 | $7.50 | 16K | chat |
| Deepseek V3.2 deepseek-deepseek-v3.2 | $2.69 | $4.00 | 164K | chatfunction_callingstructured_output |
| DeepSeek V3 0324 deepseek-deepseek-v3-0324 | $2.70 | $11.20 | 164K | chatfunction_callingstructured_output |
| Deepseek V3.1 Terminus deepseek-deepseek-v3.1-terminus | $2.70 | $10.00 | 131K | chatfunction_callingstructured_output |
| Deepseek V3.2 Exp deepseek-deepseek-v3.2-exp | $2.70 | $4.10 | 164K | chatfunction_callingstructured_output |
| Llama 4 Maverick Instruct meta-llama-llama-4-maverick-17b-128e-instruct-fp8 | $2.70 | $8.50 | 1.0M | chatvision |
| DeepSeek V3.1 deepseek-deepseek-v3.1 | $2.70 | $10.00 | 131K | chatfunction_callingstructured_output |
| ERNIE 4.5 300B A47B baidu-ernie-4.5-300b-a47b-paddle | $2.80 | $11.00 | 123K | chatstructured_output |
| DeepSeek R1 Distill Qwen 32B deepseek-deepseek-r1-distill-qwen-32b | $3.00 | $3.00 | 64K | chat |
| GLM 4.6V zai-org-glm-4.6v | $3.00 | $9.00 | 131K | chatfunction_callingstructured_output |
| Qwen3 Coder 480B A35B Instruct qwen-qwen3-coder-480b-a35b-instruct | $3.00 | $13.00 | 262K | chatfunction_callingstructured_output |
| Minimax M2.1 minimax-minimax-m2.1 | $3.00 | $12.00 | 205K | chatfunction_callingstructured_output |
| Qwen3 235B A22b Thinking 2507 qwen-qwen3-235b-a22b-thinking-2507 | $3.00 | $30.00 | 131K | chatfunction_callingreasoning |
| Kat Coder Pro kwaipilot-kat-coder-pro | $3.00 | $12.00 | 256K | chatfunction_callingstructured_output |
| Qwen3 VL 235B A22B Instruct qwen-qwen3-vl-235b-a22b-instruct | $3.00 | $15.00 | 131K | chatfunction_callingstructured_output |
| MiniMax-M2 minimax-minimax-m2 | $3.00 | $12.00 | 205K | chatfunction_callingreasoning |
| Qwen 2.5 72B Instruct qwen-qwen-2.5-72b-instruct | $3.80 | $4.00 | 32K | chatfunction_callingstructured_output |
| ERNIE-4.5-VL-28B-A3B-Thinking baidu-ernie-4.5-vl-28b-a3b-thinking | $3.90 | $3.90 | 131K | chatfunction_callingstructured_output |
| DeepSeek V3 (Turbo) deepseek-deepseek-v3-turbo | $4.00 | $13.00 | 64K | chatfunction_calling |
| ERNIE 4.5 VL 424B A47B baidu-ernie-4.5-vl-424b-a47b | $4.20 | $12.50 | 123K | chatreasoningvision |
| Llama3 70B Instruct meta-llama-llama-3-70b-instruct | $5.10 | $7.40 | 8K | chatstructured_output |
| GLM 4.6 zai-org-glm-4.6 | $5.50 | $22.00 | 205K | chatfunction_callingstructured_output |
| MiniMax M1 minimaxai-minimax-m1-80k | $5.50 | $22.00 | 1.0M | chatfunction_callingreasoning |
| Kimi K2 Instruct moonshotai-kimi-k2-instruct | $5.70 | $23.00 | 131K | chatfunction_callingstructured_output |
| Kimi K2.5 moonshotai-kimi-k2.5 | $6.00 | $30.00 | 262K | chatreasoningstructured_output |
| GLM-4.7 zai-org-glm-4.7 | $6.00 | $22.00 | 205K | chatfunction_callingstructured_output |
| GLM 4.5V zai-org-glm-4.5v | $6.00 | $18.00 | 66K | chatfunction_callingstructured_output |
| Kimi K2 Thinking moonshotai-kimi-k2-thinking | $6.00 | $25.00 | 262K | chatfunction_callingstructured_output |
| Kimi K2 0905 moonshotai-kimi-k2-0905 | $6.00 | $25.00 | 262K | chatfunction_callingstructured_output |
| GLM-4.5 zai-org-glm-4.5 | $6.00 | $22.00 | 131K | chatfunction_callingreasoning |
| Wizardlm 2 8x22B microsoft-wizardlm-2-8x22b | $6.20 | $6.20 | 66K | chat |
| DeepSeek R1 (Turbo) deepseek-deepseek-r1-turbo | $7.00 | $25.00 | 64K | chatfunction_callingreasoning |
| DeepSeek R1 0528 deepseek-deepseek-r1-0528 | $7.00 | $25.00 | 164K | chatfunction_callingstructured_output |
| Deepseek Prover V2 671B deepseek-deepseek-prover-v2-671b | $7.00 | $25.00 | 160K | chat |
| DeepSeek R1 Distill LLama 70B deepseek-deepseek-r1-distill-llama-70b | $8.00 | $8.00 | 8K | chatstructured_outputreasoning |
| Qwen2.5 VL 72B Instruct qwen-qwen2.5-vl-72b-instruct | $8.00 | $8.00 | 33K | chatvision |
| Qwen3 VL 235B A22B Thinking qwen-qwen3-vl-235b-a22b-thinking | $9.80 | $39.50 | 131K | chatreasoningvision |
| L3 70B Euryale V2.1 sao10k-l3-70b-euryale-v2.1 | $14.80 | $14.80 | 8K | chatfunction_calling |
| L31 70B Euryale V2.2 sao10k-l31-70b-euryale-v2.2 | $14.80 | $14.80 | 8K | chatfunction_calling |
| Qwen3 Max qwen-qwen3-max | $21.10 | $84.50 | 262K | chatfunction_callingstructured_output |
| Text to Video text-to-video | $32.00 | $20.00 | - | chat |