All AI Models

511 models across 51 providers. Click any model for detailed pricing, benchmarks, and speed data.

OpenAI

61 models
GPT-5.5 (xhigh)
$11.2560 t/s
GPT-5.5 (high)
$11.2563 t/s
GPT-5.4 (xhigh)
$5.6379 t/s
GPT-5.5 (medium)
$11.2559 t/s
GPT-5.3 Codex (xhigh)
$4.8173 t/s
GPT-5.2 (xhigh)
$4.8167 t/s
GPT-5.5 (low)
$11.2558 t/s
GPT-5.2 Codex (xhigh)
$4.81102 t/s
GPT-5.4 mini (xhigh)
$1.69157 t/s
GPT-5.4 (low)
$5.6363 t/s
GPT-5.1 (high)
$3.44120 t/s
GPT-5.2 (medium)
$4.810 t/s
GPT-5 (high)
$3.4489 t/s
GPT-5 Codex (high)
$3.44166 t/s
GPT-5.4 nano (xhigh)
$0.46155 t/s
GPT-5.1 Codex (high)
$3.44170 t/s
GPT-5 (medium)
$3.4479 t/s
GPT-5 mini (high)
$0.6998 t/s
GPT-5.5 (Non-reasoning)
$11.2558 t/s
o3-pro
$35.0022 t/s
GPT-5 (low)
$3.4464 t/s
GPT-5 mini (medium)
$0.69110 t/s
GPT-5.1 Codex mini (high)
$0.69177 t/s
o3
$3.5086 t/s
GPT-5.4 nano (medium)
$0.46150 t/s
GPT-5.4 mini (medium)
$1.69150 t/s
GPT-5.4 (Non-reasoning)
$5.6362 t/s
GPT-5.2 (Non-reasoning)
$4.8160 t/s
gpt-oss-120b (high)
$0.26272 t/s
o4-mini (high)
$1.93169 t/s
o1
$26.2592 t/s
GPT-5.1 (Non-reasoning)
$3.44104 t/s
GPT-5 nano (high)
$0.14164 t/s
GPT-4.1
$3.50101 t/s
GPT-5 nano (medium)
$0.14151 t/s
o3-mini
$1.93162 t/s
o1-pro
$262.500 t/s
o3-mini (high)
$1.93162 t/s
gpt-oss-120b (low)
$0.26276 t/s
gpt-oss-20B (high)
$0.09270 t/s
GPT-5.4 nano (Non-Reasoning)
$0.46154 t/s
GPT-5 (minimal)
$3.4466 t/s
o1-preview
$28.880 t/s
GPT-5.4 mini (Non-Reasoning)
$1.69154 t/s
GPT-4.1 mini
$0.7073 t/s
GPT-5 (ChatGPT)
$3.44170 t/s
gpt-oss-20B (low)
$0.10264 t/s
GPT-5 mini (minimal)
$0.69117 t/s
o1-mini
$0.000 t/s
GPT-4.5 (Preview)
$0.000 t/s
GPT-4o (Aug '24)
$4.3895 t/s
GPT-4o (March 2025, chatgpt-4o-latest)
$0.000 t/s
GPT-4o (Nov '24)
$4.38149 t/s
GPT-4o (May '24)
$7.50107 t/s
GPT-4o (ChatGPT)
$0.000 t/s
GPT-5 nano (minimal)
$0.14154 t/s
GPT-4 Turbo
$15.0032 t/s
GPT-4.1 nano
$0.17148 t/s
GPT-4
$37.5029 t/s
GPT-4o mini
$0.2667 t/s
GPT-3.5 Turbo
$0.75104 t/s

Anthropic

30 models

Google

51 models
Gemini 3.1 Pro Preview
$4.50135 t/s
Gemini 3.5 Flash (high)
$3.38227 t/s
Gemini 3 Pro Preview (high)
$4.50141 t/s
Gemini 3 Flash Preview (Reasoning)
$1.13196 t/s
Gemini 3 Pro Preview (low)
$4.500 t/s
Gemma 4 31B (Reasoning)
$0.0036 t/s
Gemini 3 Flash Preview (Non-reasoning)
$1.13186 t/s
Gemini 2.5 Pro
$3.44125 t/s
Gemini 3.1 Flash-Lite Preview
$0.56288 t/s
Gemma 4 31B (Non-reasoning)
$0.2029 t/s
Gemma 4 26B A4B (Reasoning)
$0.200 t/s
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)
$0.000 t/s
Gemini 2.5 Pro Preview (Mar' 25)
$0.000 t/s
Gemini 2.5 Pro Preview (May' 25)
$3.440 t/s
Gemma 4 26B A4B (Non-reasoning)
$0.2065 t/s
Gemini 2.5 Flash (Reasoning)
$0.85218 t/s
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash Preview (Reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
$0.170 t/s
Gemini 2.5 Flash (Non-reasoning)
$0.85193 t/s
Gemini 2.0 Flash Thinking Experimental (Jan '25)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
$0.170 t/s
Gemma 4 E4B (Reasoning)
$0.000 t/s
Gemini 2.0 Flash (Feb '25)
$0.260 t/s
Gemini 2.0 Pro Experimental (Feb '25)
$0.000 t/s
Gemini 2.5 Flash Preview (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite (Reasoning)
$0.17238 t/s
Gemini 2.0 Flash (experimental)
$0.000 t/s
Gemini 1.5 Pro (Sep '24)
$0.000 t/s
Gemma 4 E2B (Reasoning)
$0.000 t/s
Gemma 4 E4B (Non-reasoning)
$0.000 t/s
Gemini 2.0 Flash-Lite (Feb '25)
$0.000 t/s
Gemini 2.0 Flash-Lite (Preview)
$0.000 t/s
Gemini 1.5 Flash (Sep '24)
$0.000 t/s
Gemini 2.5 Flash-Lite (Non-reasoning)
$0.17203 t/s
Gemini 2.0 Flash Thinking Experimental (Dec '24)
$0.000 t/s
Gemma 4 E2B (Non-reasoning)
$0.000 t/s
Gemini 1.5 Pro (May '24)
$0.000 t/s
Gemini 1.5 Flash-8B
$0.000 t/s
Gemini 1.5 Flash (May '24)
$0.000 t/s
Gemma 3 27B Instruct
$0.140 t/s
Gemini 1.0 Ultra
$0.000 t/s
Gemma 3n E4B Instruct Preview (May '25)
$0.000 t/s
Gemma 3 12B Instruct
$0.140 t/s
PALM-2
$0.000 t/s
Gemini 1.0 Pro
$0.000 t/s
Gemma 3 270M
$0.000 t/s
Gemma 3n E4B Instruct
$0.0359 t/s
Gemma 3 4B Instruct
$0.050 t/s
Gemma 3 1B Instruct
$0.000 t/s
Gemma 3n E2B Instruct
$0.000 t/s

DeepSeek

31 models

Meta

17 models

Mistral

32 models

xAI

18 models

Alibaba / Qwen

79 models
Qwen3.6 Max Preview
$2.9237 t/s
Qwen3.6 Plus
$1.1353 t/s
Qwen3.6 27B (Reasoning)
$1.3563 t/s
Qwen3.5 397B A17B (Reasoning)
$1.3552 t/s
Qwen3.6 35B A3B (Reasoning)
$0.56176 t/s
Qwen3.5 27B (Reasoning)
$0.8290 t/s
Qwen3.5 122B A10B (Reasoning)
$1.10160 t/s
Qwen3.5 397B A17B (Non-reasoning)
$1.3553 t/s
Qwen3 Max Thinking
$2.4045 t/s
Qwen3.5 Omni Plus
$1.5056 t/s
Qwen3.5 27B (Non-reasoning)
$0.8390 t/s
Qwen3.6 27B (Non-reasoning)
$1.3559 t/s
Qwen3.5 35B A3B (Reasoning)
$0.69131 t/s
Qwen3.5 122B A10B (Non-reasoning)
$1.10152 t/s
Qwen3 Max Thinking (Preview)
$2.4044 t/s
Qwen3.5 9B (Reasoning)
$0.1158 t/s
Qwen3.6 35B A3B (Non-reasoning)
$0.84166 t/s
Qwen3 Max
$3.0533 t/s
Qwen3.5 35B A3B (Non-reasoning)
$0.69138 t/s
Qwen3 235B A22B 2507 (Reasoning)
$0.8459 t/s
Qwen3 Coder Next
$0.5681 t/s
Qwen3 VL 235B A22B (Reasoning)
$2.1736 t/s
Qwen3.5 9B (Non-reasoning)
$0.000 t/s
Qwen3.5 4B (Reasoning)
$0.06165 t/s
Qwen3 Next 80B A3B (Reasoning)
$1.88151 t/s
Qwen3 Max (Preview)
$2.4048 t/s
Qwen3.5 Omni Flash
$0.28232 t/s
Qwen3 235B A22B 2507 Instruct
$0.3664 t/s
Qwen3 Coder 480B A35B Instruct
$0.6865 t/s
Qwen3 VL 32B (Reasoning)
$2.6397 t/s
Qwen3.5 4B (Non-reasoning)
$0.06173 t/s
Qwen3 30B A3B 2507 (Reasoning)
$0.67144 t/s
Qwen3 VL 235B A22B Instruct
$0.7052 t/s
Qwen3 Next 80B A3B Instruct
$0.88158 t/s
Qwen3 Coder 30B A3B Instruct
$0.3598 t/s
Qwen3 235B A22B (Reasoning)
$2.6363 t/s
Qwen3 VL 30B A3B (Reasoning)
$0.34126 t/s
QwQ 32B
$0.7431 t/s
Qwen3 4B 2507 (Reasoning)
$0.000 t/s
Qwen3 VL 32B Instruct
$1.2355 t/s
Qwen3 235B A22B (Non-reasoning)
$0.7967 t/s
Qwen3 VL 8B (Reasoning)
$0.66135 t/s
Qwen3 32B (Reasoning)
$0.28104 t/s
Qwen3.5 2B (Reasoning)
$0.040 t/s
Qwen2.5 Max
$2.8051 t/s
Qwen3 14B (Reasoning)
$0.7364 t/s
Qwen3 VL 30B A3B Instruct
$0.30121 t/s
Qwen3 Omni 30B A3B (Reasoning)
$0.4399 t/s
Qwen2.5 Instruct 72B
$0.3755 t/s
Qwen3 30B A3B (Reasoning)
$0.1866 t/s
QwQ 32B-Preview
$0.000 t/s
Qwen3 30B A3B 2507 Instruct
$0.21125 t/s
Qwen3.5 2B (Non-reasoning)
$0.04318 t/s
Qwen3 32B (Non-reasoning)
$0.26102 t/s
Qwen3 VL 8B Instruct
$0.31147 t/s
Qwen3 4B (Reasoning)
$0.40105 t/s
Qwen3 VL 4B (Reasoning)
$0.000 t/s
Qwen3 8B (Reasoning)
$0.3790 t/s
Qwen2.5 Instruct 32B
$0.000 t/s
Qwen2.5 Coder Instruct 32B
$0.000 t/s
Qwen3 4B 2507 Instruct
$0.000 t/s
Qwen3 14B (Non-reasoning)
$0.3863 t/s
Qwen3 4B (Non-reasoning)
$0.19104 t/s
Qwen3 30B A3B (Non-reasoning)
$0.1370 t/s
Qwen2.5 Turbo
$0.0970 t/s
Qwen2 Instruct 72B
$0.000 t/s
Qwen3 Omni 30B A3B Instruct
$0.43108 t/s
Qwen3 8B (Non-reasoning)
$0.1889 t/s
Qwen3.5 0.8B (Reasoning)
$0.020 t/s
Qwen2.5 Coder Instruct 7B
$0.000 t/s
Qwen3.5 0.8B (Non-reasoning)
$0.02132 t/s
Qwen3 VL 4B Instruct
$0.000 t/s
Qwen1.5 Chat 110B
$0.000 t/s
Qwen Chat 72B
$0.000 t/s
Qwen3 1.7B (Reasoning)
$0.40139 t/s
Qwen Chat 14B
$0.000 t/s
Qwen3 1.7B (Non-reasoning)
$0.19139 t/s
Qwen3 0.6B (Reasoning)
$0.40225 t/s
Qwen3 0.6B (Non-reasoning)
$0.19223 t/s

NVIDIA

17 models

Amazon

14 models

Microsoft

4 models

Cohere

4 models

Moonshot

8 models

ai2

10 models

ai21-labs

7 models

arcee

1 models

baidu

2 models

bytedance_seed

2 models

china-mobile

2 models

databricks

1 models

deepcogito

1 models

ibm

10 models

inception

1 models

inclusionai

8 models

korea-telecom

2 models

kwaikat

2 models

lg

7 models

liquidai

8 models

longcat

1 models

mbzuai

4 models

minimax

6 models

motif-technologies

1 models

nanbeige

1 models

nous-research

7 models

openbmb

1 models

openchat

1 models

perplexity

5 models

prime-intellect

1 models

reka-ai

2 models

sarvam

3 models

servicenow

2 models

snowflake

1 models

stepfun

3 models

swiss-ai-initiative

2 models

tencent

2 models

tii-uae

1 models

trillionlabs

2 models

upstage

7 models

xiaomi

9 models

zai

18 models