Inference Providers
·
Metrics for top trending models
zai-org/GLM-4.6 | novita | live | 0.6 | 2.2 | 204800 | 0.61 | 43 | Yes | No |
zai-org/GLM-4.6 | zai-org | live | - | - | - | 1.47 | 65 | Yes | No |
deepseek-ai/DeepSeek-V3.2-Exp | novita | live | 0.27 | 0.41 | 163840 | 1.25 | 28 | Yes | Yes |
openai/gpt-oss-20b | fireworks-ai | live | 0.05 | 0.2 | 131072 | 0.70 | 205 | Yes | No |
openai/gpt-oss-20b | novita | live | 0.04 | 0.15 | 131072 | 1.23 | 232 | No | Yes |
openai/gpt-oss-20b | nebius | live | 0.05 | 0.2 | 131072 | 0.34 | 157 | Yes | No |
openai/gpt-oss-20b | nscale | live | 0.05 | 0.2 | 131072 | 0.85 | 71 | Yes | Yes |
openai/gpt-oss-20b | groq | live | 0.1 | 0.5 | 131072 | 0.61 | 614 | Yes | No |
openai/gpt-oss-20b | hyperbolic | live | 0.1 | 0.1 | 131072 | 0.47 | 209 | No | No |
openai/gpt-oss-20b | together | live | 0.05 | 0.2 | 131072 | 0.27 | 177 | Yes | No |
meta-llama/Llama-3.1-8B-Instruct | fireworks-ai | live | 0.2 | 0.2 | 131072 | 0.73 | 236 | No | No |
meta-llama/Llama-3.1-8B-Instruct | cerebras | live | 0.1 | 0.1 | - | 0.29 | 1297 | No | No |
meta-llama/Llama-3.1-8B-Instruct | novita | live | 0.02 | 0.05 | 16384 | 0.65 | 71 | No | No |
meta-llama/Llama-3.1-8B-Instruct | nebius | live | 0.03 | 0.09 | 131072 | 0.43 | 151 | No | No |
meta-llama/Llama-3.1-8B-Instruct | nscale | live | 0.06 | 0.06 | 131072 | 0.66 | 61 | No | Yes |
meta-llama/Llama-3.1-8B-Instruct | sambanova | live | 0.1 | 0.2 | 16384 | 0.45 | 538 | Yes | Yes |
meta-llama/Llama-3.1-8B-Instruct | scaleway | live | - | - | - | 0.40 | 94 | Yes | Yes |
openai/gpt-oss-120b | fireworks-ai | live | 0.15 | 0.6 | 131072 | 0.81 | 138 | Yes | No |
openai/gpt-oss-120b | cerebras | live | 0.25 | 0.69 | - | 0.25 | 764 | Yes | No |
openai/gpt-oss-120b | novita | live | 0.1 | 0.5 | 131072 | 0.52 | 113 | Yes | Yes |
openai/gpt-oss-120b | nebius | live | 0.15 | 0.6 | 131072 | 0.62 | 152 | Yes | Yes |
openai/gpt-oss-120b | nscale | live | 0.1 | 0.4 | 131072 | 0.94 | 119 | Yes | Yes |
openai/gpt-oss-120b | groq | live | 0.15 | 0.75 | 131072 | 0.27 | 403 | Yes | No |
openai/gpt-oss-120b | hyperbolic | live | 0.3 | 0.3 | 131072 | 0.69 | 273 | Yes | No |
openai/gpt-oss-120b | together | live | 0.15 | 0.6 | 131072 | 0.36 | 106 | Yes | Yes |
openai/gpt-oss-120b | sambanova | live | 0.22 | 0.59 | 131072 | 1.54 | 363 | Yes | Yes |
openai/gpt-oss-120b | scaleway | live | - | - | - | 0.34 | 155 | Yes | Yes |
Qwen/Qwen3-8B | nscale | live | 0.07 | 0.18 | 40960 | 0.62 | 53 | Yes | No |
Qwen/Qwen3-4B-Instruct-2507 | nscale | live | 0.01 | 0.03 | 262144 | 0.48 | 58 | Yes | No |
Qwen/Qwen3-Coder-30B-A3B-Instruct | fireworks-ai | offline | 0.15 | 0.6 | 262144 | 0.90 | 102 | Yes | No |
Qwen/Qwen3-Coder-30B-A3B-Instruct | nebius | live | 0.1 | 0.3 | 262144 | 0.43 | 128 | Yes | Yes |
Qwen/Qwen3-Coder-30B-A3B-Instruct | scaleway | live | - | - | - | 0.55 | 75 | Yes | No |
deepseek-ai/DeepSeek-R1 | fireworks-ai | offline | 3 | 8 | 163840 | 0.83 | 70 | No | No |
deepseek-ai/DeepSeek-R1 | novita | live | 0.7 | 2.5 | 64000 | 0.73 | 28 | Yes | No |
deepseek-ai/DeepSeek-R1 | hyperbolic | live | 2 | 2 | 163840 | 0.95 | 37 | No | No |
deepseek-ai/DeepSeek-R1 | together | live | 3 | 7 | 163840 | 0.86 | 49 | No | Yes |
deepseek-ai/DeepSeek-R1 | sambanova | live | - | - | - | 1.79 | 142 | Yes | Yes |
Qwen/Qwen3-30B-A3B-Instruct-2507 | nebius | live | 0.1 | 0.3 | 262144 | 0.29 | 109 | Yes | Yes |
Qwen/Qwen3-Next-80B-A3B-Instruct | novita | live | 0.15 | 1.5 | 131072 | 0.71 | 114 | Yes | No |
Qwen/Qwen3-Next-80B-A3B-Instruct | hyperbolic | live | 0.3 | 0.3 | 262144 | 0.49 | 161 | Yes | No |
Qwen/Qwen3-Next-80B-A3B-Instruct | together | live | 0.15 | 1.5 | 262144 | 0.92 | 141 | Yes | Yes |
Kwaipilot/KAT-Dev | novita | live | 0.15 | 0.4 | 65536 | 1.13 | 38 | Yes | Yes |
Qwen/Qwen2.5-VL-7B-Instruct | hyperbolic | live | 0.2 | 0.2 | 32768 | 0.42 | 44 | No | No |
Qwen/Qwen3-VL-235B-A22B-Instruct | novita | live | 0.3 | 1.5 | 131072 | 1.74 | 37 | Yes | Yes |
meta-llama/Llama-3.2-1B-Instruct | novita | live | - | - | 131000 | 0.64 | 180 | No | No |
meta-llama/Llama-3.2-1B-Instruct | sambanova | offline | - | - | - | 0.44 | - | - | - |
moonshotai/Kimi-K2-Instruct-0905 | novita | live | 0.6 | 2.5 | 262144 | 0.51 | 43 | Yes | Yes |
moonshotai/Kimi-K2-Instruct-0905 | groq | live | - | - | 262144 | 0.61 | 161 | Yes | No |
moonshotai/Kimi-K2-Instruct-0905 | together | live | 1 | 3 | 262144 | 0.57 | 48 | Yes | Yes |
mistralai/Mistral-7B-Instruct-v0.3 | novita | offline | 0.029 | 0.059 | 32768 | 0.83 | 118 | No | No |
mistralai/Mistral-7B-Instruct-v0.3 | together | live | 0.2 | 0.2 | 32768 | 0.61 | 160 | No | Yes |
Qwen/Qwen3-4B-Thinking-2507 | nscale | live | 0.01 | 0.03 | 262144 | 0.69 | 27 | Yes | No |
Qwen/Qwen3-VL-235B-A22B-Thinking | novita | live | 0.98 | 3.95 | 131072 | 3.36 | 41 | No | No |
zai-org/GLM-4.6-FP8 | zai-org | live | - | - | - | 1.05 | 68 | Yes | No |
meta-llama/Llama-3.2-3B-Instruct | novita | live | 0.03 | 0.05 | 32768 | 0.72 | 122 | Yes | No |
meta-llama/Llama-3.2-3B-Instruct | hyperbolic | live | 0.1 | 0.1 | 131072 | 1.30 | 110 | No | No |
meta-llama/Llama-3.2-3B-Instruct | together | live | 0.06 | 0.06 | 131072 | 0.66 | 144 | Yes | Yes |
meta-llama/Llama-3.2-3B-Instruct | sambanova | offline | - | - | - | 0.38 | - | - | - |
meta-llama/Llama-3.3-70B-Instruct | fireworks-ai | live | 0.9 | 0.9 | 131072 | 1.18 | 120 | No | No |
meta-llama/Llama-3.3-70B-Instruct | cerebras | live | 0.85 | 1.2 | - | 0.32 | 263 | Yes | No |
meta-llama/Llama-3.3-70B-Instruct | novita | live | 0.13 | 0.39 | 131072 | 0.48 | 26 | Yes | No |
meta-llama/Llama-3.3-70B-Instruct | nebius | live | 0.25 | 0.75 | 131072 | 0.35 | 109 | Yes | Yes |
meta-llama/Llama-3.3-70B-Instruct | nscale | live | 0.4 | 0.4 | 131072 | 0.54 | 17 | No | Yes |
meta-llama/Llama-3.3-70B-Instruct | groq | live | 0.59 | 0.79 | 131072 | 0.23 | 368 | Yes | No |
meta-llama/Llama-3.3-70B-Instruct | hyperbolic | live | 0.4 | 0.4 | 131072 | 0.60 | 106 | No | No |
meta-llama/Llama-3.3-70B-Instruct | together | live | 0.88 | 0.88 | 131072 | 0.40 | 118 | Yes | Yes |
meta-llama/Llama-3.3-70B-Instruct | sambanova | live | 0.6 | 1.2 | 131072 | 0.36 | 310 | Yes | Yes |
meta-llama/Llama-3.3-70B-Instruct | scaleway | live | - | - | - | 0.55 | 24 | Yes | Yes |
zai-org/GLM-4.5-Air | fireworks-ai | offline | 0.22 | 0.88 | 131072 | 0.76 | 106 | Yes | No |
zai-org/GLM-4.5-Air | nebius | live | 0.2 | 1.2 | 131072 | 0.37 | 70 | Yes | Yes |
zai-org/GLM-4.5-Air | zai-org | live | - | - | - | 1.14 | 68 | Yes | No |
meta-llama/Meta-Llama-3-8B-Instruct | novita | live | 0.04 | 0.04 | 8192 | 0.74 | 70 | No | No |
meta-llama/Meta-Llama-3-8B-Instruct | groq | offline | 0.05 | 0.08 | 8192 | 0.19 | 1056 | Yes | No |
meta-llama/Meta-Llama-3-8B-Instruct | together | offline | - | - | - | - | - | - | - |
meta-llama/Llama-4-Scout-17B-16E-Instruct | fireworks-ai | live | 0.15 | 0.6 | 1048576 | 0.55 | 57 | Yes | No |
meta-llama/Llama-4-Scout-17B-16E-Instruct | cerebras | live | 0.65 | 0.85 | - | 0.25 | 530 | Yes | No |
meta-llama/Llama-4-Scout-17B-16E-Instruct | novita | live | 0.1 | 0.5 | 131072 | 0.55 | 36 | Yes | No |
meta-llama/Llama-4-Scout-17B-16E-Instruct | nscale | live | 0.09 | 0.29 | 890000 | 1.16 | 33 | Yes | Yes |
meta-llama/Llama-4-Scout-17B-16E-Instruct | groq | live | 0.11 | 0.34 | 131072 | 0.20 | 362 | Yes | No |
meta-llama/Llama-4-Scout-17B-16E-Instruct | together | live | 0.18 | 0.59 | 1048576 | 0.28 | 66 | Yes | Yes |
meta-llama/Llama-4-Scout-17B-16E-Instruct | sambanova | offline | - | - | - | 0.86 | - | - | - |
Qwen/Qwen3-Coder-480B-A35B-Instruct | fireworks-ai | live | 0.45 | 1.8 | 262144 | 1.15 | 58 | Yes | No |
Qwen/Qwen3-Coder-480B-A35B-Instruct | cerebras | live | 2 | 2 | - | 0.32 | 612 | Yes | No |
Qwen/Qwen3-Coder-480B-A35B-Instruct | novita | live | 0.29 | 1.2 | 262144 | 0.75 | 59 | Yes | Yes |
Qwen/Qwen3-Coder-480B-A35B-Instruct | nebius | live | 0.4 | 1.8 | 262144 | 0.77 | 63 | Yes | Yes |
Qwen/Qwen3-Coder-480B-A35B-Instruct | hyperbolic | live | 2 | 2 | 262144 | 1.42 | 48 | Yes | No |
Qwen/Qwen3-Coder-480B-A35B-Instruct | together | live | 2 | 2 | 262144 | 0.57 | 52 | Yes | Yes |
Qwen/Qwen3-32B | cerebras | live | 0.4 | 0.8 | - | 0.30 | 721 | No | No |
Qwen/Qwen3-32B | novita | live | 0.1 | 0.45 | 40960 | 0.71 | 51 | No | No |
Qwen/Qwen3-32B | nebius | live | 0.1 | 0.3 | 40960 | 0.33 | 45 | Yes | No |
Qwen/Qwen3-32B | nscale | live | 0.08 | 0.25 | 40960 | 0.99 | 25 | Yes | Yes |
Qwen/Qwen3-32B | groq | live | 0.29 | 0.59 | 131072 | 0.19 | 251 | Yes | No |
Qwen/Qwen3-32B | sambanova | live | 0.4 | 0.8 | 32768 | 0.91 | 239 | Yes | Yes |
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | nscale | live | 0.1 | 0.1 | 131072 | 0.49 | 134 | No | No |
Qwen/Qwen2.5-7B-Instruct | together | live | 0.3 | 0.3 | 32768 | 0.22 | 145 | Yes | Yes |
swiss-ai/Apertus-8B-Instruct-2509 | publicai | live | - | - | - | 1.18 | 92 | No | Yes |
Qwen/Qwen3-14B | nebius | live | 0.08 | 0.24 | 40960 | 0.71 | 81 | Yes | Yes |
Qwen/Qwen3-14B | nscale | live | 0.07 | 0.2 | 40960 | 0.91 | 35 | Yes | No |
deepseek-ai/DeepSeek-V3.1-Terminus | novita | live | 0.27 | 1 | 131072 | 1.48 | 59 | Yes | Yes |
HuggingFaceTB/SmolLM3-3B | hf-inference | live | - | - | - | 0.18 | 87 | Yes | Yes |
zai-org/GLM-4.5 | fireworks-ai | live | 0.55 | 2.19 | 131072 | 1.20 | 69 | Yes | No |
zai-org/GLM-4.5 | novita | live | 0.6 | 2.2 | 131072 | 0.93 | 54 | Yes | No |
zai-org/GLM-4.5 | nebius | live | 0.6 | 2.2 | 131072 | 0.30 | 39 | Yes | Yes |
zai-org/GLM-4.5 | zai-org | live | - | - | - | 0.96 | 52 | Yes | No |
google/gemma-3-27b-it | nebius | live | 0.2 | 0.6 | 110000 | 0.42 | 77 | No | Yes |
google/gemma-3-27b-it | scaleway | live | - | - | - | 1.06 | 42 | Yes | No |
moonshotai/Kimi-K2-Instruct | fireworks-ai | live | 0.6 | 2.5 | 131072 | 2.33 | 47 | Yes | No |
moonshotai/Kimi-K2-Instruct | novita | live | 0.57 | 2.3 | 131072 | 0.87 | 55 | Yes | Yes |
moonshotai/Kimi-K2-Instruct | nebius | live | 0.5 | 2.4 | 131072 | 0.39 | 50 | Yes | Yes |
moonshotai/Kimi-K2-Instruct | together | live | 1 | 3 | 131072 | 0.96 | 32 | Yes | Yes |
Qwen/Qwen3-30B-A3B-Thinking-2507 | nebius | live | 0.1 | 0.3 | 262144 | 0.78 | 108 | Yes | Yes |
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | novita | live | 0.3 | 0.3 | 64000 | 0.90 | 35 | No | Yes |
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | nscale | live | 0.3 | 0.3 | 131072 | 0.72 | 26 | No | Yes |
google/gemma-2-2b-it | nebius | live | 0.02 | 0.06 | 8192 | 0.31 | 78 | No | Yes |
deepseek-ai/DeepSeek-R1-0528 | fireworks-ai | live | 3 | 8 | 163840 | 1.12 | 37 | No | No |
deepseek-ai/DeepSeek-R1-0528 | novita | live | 0.7 | 2.5 | 163840 | 0.72 | 28 | Yes | No |
deepseek-ai/DeepSeek-R1-0528 | nebius | live | 0.8 | 2.4 | 163840 | 1.13 | 31 | Yes | Yes |
deepseek-ai/DeepSeek-R1-0528 | hyperbolic | live | 3 | 3 | 163840 | 0.67 | 48 | No | No |
deepseek-ai/DeepSeek-R1-0528 | together | live | 3 | 7 | 163840 | 0.54 | 51 | No | Yes |
deepseek-ai/DeepSeek-R1-0528 | sambanova | live | 5 | 7 | 131072 | 0.51 | 211 | Yes | Yes |
meta-llama/Llama-4-Maverick-17B-128E-Instruct | fireworks-ai | live | 0.22 | 0.88 | 1048576 | 1.27 | 73 | Yes | No |
meta-llama/Llama-4-Maverick-17B-128E-Instruct | cerebras | live | 0.2 | 0.6 | - | 0.28 | 758 | Yes | No |
meta-llama/Llama-4-Maverick-17B-128E-Instruct | groq | live | 0.2 | 0.6 | 131072 | 0.20 | 494 | Yes | No |
meta-llama/Llama-4-Maverick-17B-128E-Instruct | sambanova | live | 0.63 | 1.8 | 131072 | 1.72 | 339 | Yes | Yes |
Qwen/Qwen3-235B-A22B-Thinking-2507 | fireworks-ai | live | 0.22 | 0.88 | 262144 | 1.21 | 40 | Yes | No |
Qwen/Qwen3-235B-A22B-Thinking-2507 | cerebras | live | 0.6 | 1.2 | - | 0.27 | 599 | No | No |
Qwen/Qwen3-235B-A22B-Thinking-2507 | novita | live | 0.3 | 3 | 131072 | 1.21 | 39 | Yes | No |
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B | novita | live | 0.06 | 0.09 | 128000 | 1.07 | 68 | No | No |
katanemo/Arch-Router-1.5B | hf-inference | live | - | - | - | 0.18 | 74 | No | Yes |
deepseek-ai/DeepSeek-R1-Distill-Llama-8B | novita | offline | 0.04 | 0.04 | 32000 | 0.63 | 52 | No | Yes |
deepseek-ai/DeepSeek-R1-Distill-Llama-8B | nscale | live | 0.05 | 0.05 | 131072 | 0.50 | 55 | No | Yes |
deepseek-ai/DeepSeek-V3 | fireworks-ai | offline | 0.9 | 0.9 | 131072 | 0.82 | 85 | Yes | No |
deepseek-ai/DeepSeek-V3 | novita | live | 0.4 | 1.3 | 64000 | 1.07 | 37 | Yes | No |
deepseek-ai/DeepSeek-V3 | nebius | live | 0.5 | 1.5 | 163840 | 0.50 | 25 | No | Yes |
deepseek-ai/DeepSeek-V3 | together | live | 1.25 | 1.25 | 131072 | 0.58 | 57 | Yes | Yes |
Qwen/Qwen3-235B-A22B-Instruct-2507 | fireworks-ai | live | 0.22 | 0.88 | 262144 | 1.04 | 44 | Yes | No |
Qwen/Qwen3-235B-A22B-Instruct-2507 | cerebras | live | 0.6 | 1.2 | - | 0.23 | 529 | Yes | No |
Qwen/Qwen3-235B-A22B-Instruct-2507 | novita | live | 0.09 | 0.58 | 131072 | 0.89 | 38 | Yes | Yes |
Qwen/Qwen3-235B-A22B-Instruct-2507 | nebius | live | 0.2 | 0.6 | 262144 | 0.75 | 61 | Yes | Yes |
Qwen/Qwen3-235B-A22B-Instruct-2507 | nscale | live | 0.2 | 0.6 | 32768 | 0.70 | 24 | Yes | Yes |
Qwen/Qwen3-235B-A22B-Instruct-2507 | hyperbolic | live | 2 | 2 | 262144 | 0.68 | 68 | Yes | No |
Qwen/Qwen3-235B-A22B-Instruct-2507 | together | live | 0.2 | 0.6 | 262144 | 0.46 | 56 | Yes | Yes |
Qwen/Qwen3-235B-A22B-Instruct-2507 | scaleway | live | - | - | - | 0.67 | 71 | Yes | Yes |
Qwen/Qwen3-Next-80B-A3B-Thinking | novita | live | 0.15 | 1.5 | 131072 | 0.94 | 142 | Yes | No |
Qwen/Qwen3-Next-80B-A3B-Thinking | hyperbolic | live | 0.3 | 0.3 | 262144 | 0.63 | 177 | Yes | No |
Qwen/Qwen3-Next-80B-A3B-Thinking | together | live | 0.15 | 1.5 | 262144 | 0.93 | 152 | Yes | Yes |
zai-org/GLM-4.5V | novita | live | 0.6 | 1.8 | 65536 | 1.10 | 64 | Yes | No |
zai-org/GLM-4.5V | zai-org | live | - | - | - | 1.07 | 80 | Yes | No |
deepseek-ai/DeepSeek-V3.1 | fireworks-ai | live | - | - | 163840 | 1.08 | 64 | Yes | No |
deepseek-ai/DeepSeek-V3.1 | novita | live | 0.27 | 1 | 131072 | 0.83 | 57 | Yes | Yes |
deepseek-ai/DeepSeek-V3.1 | together | live | 0.6 | 1.7 | 131072 | 0.92 | 114 | Yes | No |
meta-llama/Llama-3.1-70B-Instruct | fireworks-ai | live | 0.9 | 0.9 | 131072 | 0.53 | 109 | No | No |
Qwen/QwQ-32B | fireworks-ai | offline | - | - | - | 0.56 | - | No | No |
Qwen/QwQ-32B | nebius | live | 0.5 | 1.5 | 131072 | 0.37 | 80 | No | Yes |
Qwen/QwQ-32B | nscale | live | 0.18 | 0.2 | 131072 | 0.63 | 24 | Yes | Yes |
Qwen/QwQ-32B | groq | offline | - | - | - | - | - | - | - |
Qwen/QwQ-32B | hyperbolic | live | 0.4 | 0.4 | 131072 | 0.62 | 90 | No | No |
Qwen/QwQ-32B | sambanova | offline | - | - | - | 0.41 | - | - | - |
Qwen/Qwen2.5-VL-72B-Instruct | nebius | live | 0.25 | 0.75 | 32000 | 0.43 | 33 | No | Yes |
Qwen/Qwen2.5-VL-72B-Instruct | hyperbolic | live | 0.6 | 0.6 | 32768 | 0.90 | 41 | No | No |
zai-org/GLM-4.5-Air-FP8 | together | live | 0.2 | 1.1 | 131072 | 0.67 | 110 | Yes | Yes |
deepseek-ai/DeepSeek-V3-0324 | fireworks-ai | live | 0.9 | 0.9 | 163840 | 1.62 | 62 | Yes | No |
deepseek-ai/DeepSeek-V3-0324 | novita | live | 0.27 | 1.12 | 163840 | 0.79 | 30 | Yes | Yes |
deepseek-ai/DeepSeek-V3-0324 | nebius | live | 0.75 | 2.25 | 32768 | 1.22 | 125 | No | No |
deepseek-ai/DeepSeek-V3-0324 | hyperbolic | live | 1.25 | 1.25 | 163840 | 1.95 | 36 | Yes | No |
deepseek-ai/DeepSeek-V3-0324 | together | live | 1.25 | 1.25 | 131072 | 0.72 | 50 | Yes | Yes |
deepseek-ai/DeepSeek-V3-0324 | sambanova | live | 3 | 4.5 | 131072 | 0.61 | 173 | Yes | Yes |
zai-org/GLM-4.5V-FP8 | zai-org | live | - | - | - | 1.54 | 74 | Yes | No |
swiss-ai/Apertus-70B-Instruct-2509 | publicai | live | - | - | - | 1.33 | 45 | No | Yes |
Qwen/Qwen2.5-Coder-7B-Instruct | nscale | live | 0.01 | 0.03 | 131072 | 0.55 | 58 | No | Yes |
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | nscale | live | 0.15 | 0.15 | 131072 | 0.47 | 68 | No | Yes |
CohereLabs/aya-expanse-8b | cohere | live | - | - | - | 0.22 | 72 | No | No |
Qwen/Qwen3-30B-A3B | fireworks-ai | live | 0.15 | 0.6 | 131072 | 2.36 | 114 | Yes | No |
Qwen/Qwen3-30B-A3B | novita | live | 0.09 | 0.45 | 40960 | 0.78 | 78 | No | No |
Qwen/Qwen3-30B-A3B | nebius | live | 0.1 | 0.3 | 40960 | 0.84 | 78 | Yes | Yes |
NousResearch/Hermes-4-70B | nebius | live | 0.13 | 0.4 | 131072 | 0.40 | 86 | No | No |
Sao10K/L3-70B-Euryale-v2.1 | novita | live | 1.48 | 1.48 | 8192 | 1.37 | 55 | No | No |
CohereLabs/aya-expanse-32b | cohere | live | - | - | - | 0.94 | 43 | No | No |
CohereLabs/command-a-translate-08-2025 | cohere | live | - | - | - | 0.24 | 65 | Yes | No |
zai-org/GLM-4.1V-9B-Thinking | novita | live | 0.035 | 0.138 | 65536 | 0.84 | 101 | No | No |
Qwen/Qwen3-235B-A22B | fireworks-ai | live | 0.22 | 0.88 | 131072 | 2.19 | 39 | Yes | No |
Qwen/Qwen3-235B-A22B | novita | live | 0.2 | 0.8 | 40960 | 1.28 | 14 | No | No |
Qwen/Qwen3-235B-A22B | nscale | live | 0.2 | 0.6 | 32000 | 0.76 | 24 | Yes | Yes |
Qwen/Qwen3-235B-A22B | together | live | 0.2 | 0.6 | 40960 | 0.40 | 38 | Yes | Yes |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B | novita | live | 0.8 | 0.8 | 32000 | 2.13 | 60 | No | Yes |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B | nscale | live | 0.75 | 0.75 | 131072 | 0.72 | 16 | No | No |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B | groq | offline | 0.75 | 0.99 | 131072 | 0.65 | 200 | Yes | No |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B | sambanova | live | 0.7 | 1.4 | 131072 | 1.19 | 178 | No | No |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B | scaleway | live | - | - | - | 0.65 | 25 | No | Yes |
NousResearch/Hermes-4-405B | nebius | live | 1 | 3 | 131072 | 0.55 | 36 | No | No |
tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4 | sambanova | live | 0.6 | 1.2 | 131072 | 3.11 | 119 | No | Yes |
deepcogito/cogito-v2-preview-llama-70B | together | live | 0.88 | 0.88 | 32768 | 1.74 | 49 | Yes | Yes |
meta-llama/Llama-3.1-405B-Instruct | fireworks-ai | live | 3 | 3 | 131072 | 3.49 | 52 | Yes | No |
meta-llama/Llama-3.1-405B-Instruct | nebius | live | 1 | 3 | 131072 | 0.34 | 30 | Yes | Yes |
meta-llama/Llama-3.1-405B-Instruct | sambanova | offline | - | - | - | 0.47 | 112 | Yes | Yes |
Qwen/Qwen2.5-Coder-7B | nebius | live | 0.03 | 0.09 | 32768 | 0.42 | 210 | No | Yes |
arcee-ai/AFM-4.5B | together | live | 0.048 | 0.15 | 65536 | 1.05 | 172 | No | Yes |
Qwen/Qwen2.5-72B-Instruct | fireworks-ai | offline | - | - | - | 0.40 | - | Yes | No |
Qwen/Qwen2.5-72B-Instruct | novita | live | 0.38 | 0.4 | 32000 | 1.71 | 44 | Yes | No |
Qwen/Qwen2.5-72B-Instruct | nebius | live | 0.13 | 0.4 | 131072 | 0.78 | 29 | Yes | No |
Qwen/Qwen2.5-72B-Instruct | hyperbolic | live | 0.4 | 0.4 | 131072 | 2.76 | 30 | No | No |
Qwen/Qwen2.5-72B-Instruct | together | live | 1.2 | 1.2 | 131072 | 0.34 | 59 | Yes | Yes |
meta-llama/Llama-Guard-4-12B | groq | live | 0.2 | 0.2 | 131072 | 0.25 | 7 | No | No |
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | together | live | 2 | 2 | 262144 | 0.65 | 53 | Yes | Yes |
Qwen/Qwen2.5-VL-32B-Instruct | fireworks-ai | live | 0.22 | 0.88 | 128000 | 0.57 | 43 | No | No |
mistralai/Mistral-Small-24B-Instruct-2501 | together | live | 0.8 | 0.8 | 32768 | 0.24 | 96 | Yes | Yes |
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | novita | live | 0.17 | 0.85 | 1048576 | 0.99 | 62 | Yes | No |
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | together | live | 0.27 | 0.85 | 1048576 | 0.36 | 50 | Yes | Yes |
marin-community/marin-8b-instruct | together | live | 0.18 | 0.18 | 4096 | 0.24 | 172 | No | Yes |
MiniMaxAI/MiniMax-M1-80k | novita | live | 0.55 | 2.2 | 1000000 | 1.28 | 38 | No | No |
aisingapore/Gemma-SEA-LION-v4-27B-IT | publicai | live | - | - | - | 1.76 | 50 | No | Yes |
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | novita | live | 0.15 | 0.15 | 32768 | 0.75 | 74 | No | No |
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | nscale | live | 0.2 | 0.2 | 131072 | 0.43 | 36 | No | Yes |
baichuan-inc/Baichuan-M2-32B | novita | live | 0.07 | 0.07 | 131072 | 1.53 | 40 | No | Yes |
Qwen/Qwen3-235B-A22B-FP8 | together | live | 0.2 | 0.6 | 40960 | 0.48 | 37 | Yes | Yes |
deepseek-ai/DeepSeek-Prover-V2-671B | novita | live | 0.7 | 2.5 | 160000 | 1.77 | 22 | No | No |
baidu/ERNIE-4.5-300B-A47B-Base-PT | novita | live | 0.28 | 1.1 | 123000 | 1.01 | 26 | No | Yes |
CohereLabs/command-a-vision-07-2025 | cohere | live | - | - | - | 0.63 | 62 | No | No |
baidu/ERNIE-4.5-VL-28B-A3B-PT | novita | live | 0.14 | 0.56 | 30000 | 1.04 | 77 | No | No |
mistralai/Mixtral-8x22B-Instruct-v0.1 | fireworks-ai | live | 1.2 | 1.2 | 65536 | 0.75 | 68 | No | No |
mistralai/Mixtral-8x22B-Instruct-v0.1 | nscale | live | 1.2 | 1.2 | 65536 | 0.60 | 20 | No | Yes |
mistralai/Mixtral-8x22B-Instruct-v0.1 | together | offline | - | - | - | - | - | - | - |
Qwen/Qwen2.5-Coder-3B-Instruct | nscale | live | 0.01 | 0.03 | 32768 | 0.91 | 69 | No | Yes |
baidu/ERNIE-4.5-21B-A3B-PT | novita | live | 0.07 | 0.28 | 120000 | 1.28 | 79 | No | No |
Qwen/Qwen2.5-Coder-32B-Instruct | nscale | live | 0.06 | 0.2 | 131072 | 0.43 | 27 | No | Yes |
Qwen/Qwen2.5-Coder-32B-Instruct | hyperbolic | live | 0.2 | 0.2 | 32768 | 1.64 | 83 | No | No |
Qwen/Qwen2.5-Coder-32B-Instruct | together | live | 0.8 | 0.8 | 16384 | 0.55 | 93 | Yes | Yes |
Qwen/Qwen2.5-Coder-32B-Instruct | scaleway | live | - | - | - | 0.54 | 40 | Yes | No |
CohereLabs/command-a-reasoning-08-2025 | cohere | live | - | - | - | 0.21 | 52 | Yes | No |
CohereLabs/c4ai-command-r7b-arabic-02-2025 | cohere | live | - | - | - | 0.32 | 74 | Yes | No |
deepcogito/cogito-v2-preview-deepseek-671B-MoE | together | live | 1.25 | 1.25 | 163840 | 0.58 | 48 | No | Yes |
deepcogito/cogito-v2-preview-llama-405B | together | live | 3.5 | 3.5 | 32768 | 0.86 | 27 | Yes | Yes |
deepcogito/cogito-v2-preview-llama-109B-MoE | together | live | 0.18 | 0.59 | 32767 | 0.58 | 84 | Yes | Yes |
baidu/ERNIE-4.5-VL-424B-A47B-Base-PT | novita | live | 0.42 | 1.25 | 123000 | 1.75 | 33 | No | No |
CohereLabs/aya-vision-8b | cohere | live | - | - | - | 2.23 | 58 | No | No |
CohereLabs/aya-vision-32b | cohere | live | - | - | - | 0.39 | 56 | No | No |
baidu/ERNIE-4.5-0.3B-PT | novita | live | - | - | 120000 | 1.88 | 95 | No | No |
CohereLabs/c4ai-command-r-08-2024 | cohere | live | - | - | - | 0.22 | 53 | Yes | No |
CohereLabs/c4ai-command-r7b-12-2024 | cohere | live | - | - | - | 0.23 | 67 | Yes | No |
CohereLabs/c4ai-command-a-03-2025 | cohere | live | - | - | - | 0.30 | 72 | Yes | No |
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 | nebius | live | 0.6 | 1.8 | 131072 | 0.43 | 36 | No | Yes |
NousResearch/Hermes-3-Llama-3.1-405B | nebius | live | 1 | 3 | 131072 | 0.73 | 24 | No | Yes |
Sao10K/L3-8B-Stheno-v3.2 | novita | live | 0.05 | 0.05 | 8192 | 0.78 | 97 | No | No |
google/gemma-2-9b-it | nebius | live | 0.03 | 0.09 | 8192 | 0.62 | 119 | No | Yes |
google/gemma-2-9b-it | groq | offline | 0.2 | 0.2 | 8192 | 0.34 | 435 | Yes | No |
meta-llama/Meta-Llama-3-70B-Instruct | novita | live | 0.51 | 0.74 | 8192 | 0.95 | 20 | No | Yes |
meta-llama/Meta-Llama-3-70B-Instruct | groq | offline | 0.59 | 0.79 | 8192 | 0.17 | 296 | Yes | No |
meta-llama/Meta-Llama-3-70B-Instruct | hyperbolic | live | 0.4 | 0.4 | 8192 | 0.63 | 95 | No | No |
meta-llama/Meta-Llama-3-70B-Instruct | together | live | 0.88 | 0.88 | 8192 | 0.39 | 107 | No | Yes |
NousResearch/Hermes-3-Llama-3.1-70B | hyperbolic | live | 0.4 | 0.4 | 12288 | 0.28 | 30 | No | No |
alpindale/WizardLM-2-8x22B | novita | live | 0.62 | 0.62 | 65535 | 1.64 | 29 | No | No |
SentientAGI/Dobby-Unhinged-Llama-3.3-70B | fireworks-ai | live | 0.9 | 0.9 | 131072 | 0.36 | 52 | No | No |
mistralai/Mixtral-8x7B-Instruct-v0.1 | together | live | 0.6 | 0.6 | 32768 | 0.26 | 66 | No | Yes |
NousResearch/Hermes-2-Pro-Llama-3-8B | novita | live | 0.14 | 0.14 | 8192 | 0.71 | 109 | No | No |
Sao10K/L3-8B-Lunaris-v1 | novita | live | 0.05 | 0.05 | 8192 | 2.43 | 40 | No | No |
Qwen/QwQ-32B-Preview | fireworks-ai | offline | - | - | - | - | - | - | - |
Qwen/QwQ-32B-Preview | hyperbolic | offline | - | - | - | - | - | - | - |
Qwen/QwQ-32B-Preview | together | live | - | - | - | 0.74 | 90 | Yes | Yes |
Qwen/QwQ-32B-Preview | sambanova | offline | - | - | - | 0.35 | - | - | - |
zai-org/GLM-4-32B-0414 | novita | live | 0.55 | 1.66 | 32000 | 1.27 | 36 | No | Yes |