All AI Models

454 models across 47 providers. Click any model for detailed pricing, benchmarks, and speed data.

OpenAI

55 models
GPT-5.4 (xhigh)
$5.6377 t/s
GPT-5.3 Codex (xhigh)
$4.8166 t/s
GPT-5.2 (xhigh)
$4.8171 t/s
GPT-5.2 Codex (xhigh)
$4.81101 t/s
GPT-5.4 mini (xhigh)
$1.69188 t/s
GPT-5.1 (high)
$3.4496 t/s
GPT-5.2 (medium)
$4.810 t/s
GPT-5 (high)
$3.4496 t/s
GPT-5 Codex (high)
$3.44166 t/s
GPT-5.4 nano (xhigh)
$0.46206 t/s
GPT-5.1 Codex (high)
$3.44168 t/s
GPT-5 (medium)
$3.4486 t/s
GPT-5 mini (high)
$0.6978 t/s
o3-pro
$35.0019 t/s
GPT-5 (low)
$3.4478 t/s
GPT-5 mini (medium)
$0.6982 t/s
GPT-5.1 Codex mini (high)
$0.69174 t/s
o3
$3.5086 t/s
GPT-5.4 nano (medium)
$0.46196 t/s
GPT-5.4 mini (medium)
$1.69183 t/s
GPT-5.4 (Non-reasoning)
$5.6365 t/s
GPT-5.2 (Non-reasoning)
$4.8170 t/s
gpt-oss-120B (high)
$0.26234 t/s
o4-mini (high)
$1.93126 t/s
o1
$26.25108 t/s
GPT-5.1 (Non-reasoning)
$3.4492 t/s
GPT-5 nano (high)
$0.14151 t/s
GPT-4.1
$3.5096 t/s
o3-mini
$1.93137 t/s
GPT-5 nano (medium)
$0.14142 t/s
o1-pro
$262.500 t/s
o3-mini (high)
$1.93143 t/s
gpt-oss-120B (low)
$0.26225 t/s
gpt-oss-20B (high)
$0.09262 t/s
GPT-5.4 nano (Non-Reasoning)
$0.46190 t/s
GPT-5 (minimal)
$3.4473 t/s
o1-preview
$28.880 t/s
GPT-5.4 mini (Non-Reasoning)
$1.69169 t/s
GPT-4.1 mini
$0.7073 t/s
GPT-5 (ChatGPT)
$3.44142 t/s
gpt-oss-20B (low)
$0.09248 t/s
GPT-5 mini (minimal)
$0.6979 t/s
o1-mini
$0.000 t/s
GPT-4.5 (Preview)
$0.000 t/s
GPT-4o (Aug '24)
$4.3889 t/s
GPT-4o (March 2025, chatgpt-4o-latest)
$0.000 t/s
GPT-4o (Nov '24)
$4.38107 t/s
GPT-4o (May '24)
$7.5088 t/s
GPT-4o (ChatGPT)
$0.000 t/s
GPT-5 nano (minimal)
$0.14127 t/s
GPT-4 Turbo
$15.0031 t/s
GPT-4.1 nano
$0.17141 t/s
GPT-4
$37.5031 t/s
GPT-4o mini
$0.2652 t/s
GPT-3.5 Turbo
$0.7591 t/s

Anthropic

28 models

Google

46 models
Gemini 3.1 Pro Preview
$4.50118 t/s
Gemini 3 Pro Preview (high)
$4.50128 t/s
Gemini 3 Flash Preview (Reasoning)
$1.13194 t/s
Gemini 3 Pro Preview (low)
$4.500 t/s
Gemma 4 31B (Reasoning)
$0.0036 t/s
Gemini 3 Flash Preview (Non-reasoning)
$1.13169 t/s
Gemini 2.5 Pro
$3.44130 t/s
Gemini 3.1 Flash-Lite Preview
$0.56205 t/s
Gemma 4 26B A4B (Reasoning)
$0.200 t/s
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)
$0.000 t/s
Gemini 2.5 Pro Preview (Mar' 25)
$0.000 t/s
Gemini 2.5 Pro Preview (May' 25)
$3.440 t/s
Gemini 2.5 Flash (Reasoning)
$0.85213 t/s
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash Preview (Reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
$0.170 t/s
Gemini 2.5 Flash (Non-reasoning)
$0.85210 t/s
Gemini 2.0 Flash Thinking Experimental (Jan '25)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
$0.170 t/s
Gemma 4 E4B
$0.000 t/s
Gemini 2.0 Flash (Feb '25)
$0.260 t/s
Gemini 2.0 Pro Experimental (Feb '25)
$0.000 t/s
Gemini 2.5 Flash Preview (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite (Reasoning)
$0.17322 t/s
Gemini 2.0 Flash (experimental)
$0.000 t/s
Gemini 1.5 Pro (Sep '24)
$0.000 t/s
Gemma 4 E2B
$0.000 t/s
Gemini 2.0 Flash-Lite (Feb '25)
$0.000 t/s
Gemini 2.0 Flash-Lite (Preview)
$0.000 t/s
Gemini 1.5 Flash (Sep '24)
$0.000 t/s
Gemini 2.5 Flash-Lite (Non-reasoning)
$0.17274 t/s
Gemini 2.0 Flash Thinking Experimental (Dec '24)
$0.000 t/s
Gemini 1.5 Pro (May '24)
$0.000 t/s
Gemini 1.5 Flash-8B
$0.000 t/s
Gemini 1.5 Flash (May '24)
$0.000 t/s
Gemma 3 27B Instruct
$0.0032 t/s
Gemma 3n E4B Instruct Preview (May '25)
$0.000 t/s
Gemini 1.0 Ultra
$0.000 t/s
Gemma 3 12B Instruct
$0.0032 t/s
PALM-2
$0.000 t/s
Gemini 1.0 Pro
$0.000 t/s
Gemma 3 270M
$0.000 t/s
Gemma 3n E4B Instruct
$0.0330 t/s
Gemma 3 4B Instruct
$0.0033 t/s
Gemma 3 1B Instruct
$0.0049 t/s
Gemma 3n E2B Instruct
$0.0047 t/s

DeepSeek

25 models

Meta

16 models

Mistral

31 models

xAI

14 models

Alibaba / Qwen

72 models
Qwen3.5 397B A17B (Reasoning)
$1.3557 t/s
Qwen3.5 27B (Reasoning)
$0.8291 t/s
Qwen3.5 122B A10B (Reasoning)
$1.10138 t/s
Qwen3.5 397B A17B (Non-reasoning)
$1.3552 t/s
Qwen3 Max Thinking
$2.4036 t/s
Qwen3.5 Omni Plus
$1.5050 t/s
Qwen3.5 27B (Non-reasoning)
$0.8287 t/s
Qwen3.5 35B A3B (Reasoning)
$0.69128 t/s
Qwen3.5 122B A10B (Non-reasoning)
$1.10157 t/s
Qwen3 Max Thinking (Preview)
$2.4043 t/s
Qwen3.5 9B (Reasoning)
$0.10172 t/s
Qwen3 Max
$2.4033 t/s
Qwen3.5 35B A3B (Non-reasoning)
$0.69142 t/s
Qwen3 235B A22B 2507 (Reasoning)
$2.6340 t/s
Qwen3 Coder Next
$0.60160 t/s
Qwen3 VL 235B A22B (Reasoning)
$2.6355 t/s
Qwen3.5 9B (Non-reasoning)
$0.08178 t/s
Qwen3.5 4B (Reasoning)
$0.06227 t/s
Qwen3 Next 80B A3B (Reasoning)
$1.88168 t/s
Qwen3 Max (Preview)
$2.4041 t/s
Qwen3 235B A22B 2507 Instruct
$1.2365 t/s
Qwen3 Coder 480B A35B Instruct
$3.0066 t/s
Qwen3 VL 32B (Reasoning)
$2.6396 t/s
Qwen3.5 4B (Non-reasoning)
$0.06220 t/s
Qwen3 30B A3B 2507 (Reasoning)
$0.75142 t/s
Qwen3 VL 235B A22B Instruct
$1.2358 t/s
Qwen3 Next 80B A3B Instruct
$0.88172 t/s
Qwen3 Coder 30B A3B Instruct
$0.9028 t/s
Qwen3 235B A22B (Reasoning)
$2.6351 t/s
QwQ 32B
$0.7433 t/s
Qwen3 VL 30B A3B (Reasoning)
$0.75129 t/s
Qwen3 4B 2507 (Reasoning)
$0.000 t/s
Qwen3 VL 32B Instruct
$1.2380 t/s
Qwen3 235B A22B (Non-reasoning)
$1.2351 t/s
Qwen3 VL 8B (Reasoning)
$0.66138 t/s
Qwen3 32B (Reasoning)
$2.63109 t/s
Qwen3.5 2B (Reasoning)
$0.04361 t/s
Qwen2.5 Max
$2.8047 t/s
Qwen3 14B (Reasoning)
$1.3164 t/s
Qwen3 VL 30B A3B Instruct
$0.35128 t/s
Qwen3 Omni 30B A3B (Reasoning)
$0.43104 t/s
Qwen2.5 Instruct 72B
$0.0056 t/s
Qwen3 30B A3B (Reasoning)
$0.7567 t/s
QwQ 32B-Preview
$0.1446 t/s
Qwen3 30B A3B 2507 Instruct
$0.3563 t/s
Qwen3.5 2B (Non-reasoning)
$0.04272 t/s
Qwen3 32B (Non-reasoning)
$1.23108 t/s
Qwen3 VL 8B Instruct
$0.31143 t/s
Qwen3 4B (Reasoning)
$0.40102 t/s
Qwen3 VL 4B (Reasoning)
$0.000 t/s
Qwen2.5 Instruct 32B
$0.000 t/s
Qwen3 8B (Reasoning)
$0.6684 t/s
Qwen2.5 Coder Instruct 32B
$0.000 t/s
Qwen3 4B 2507 Instruct
$0.000 t/s
Qwen3 14B (Non-reasoning)
$0.6166 t/s
Qwen3 4B (Non-reasoning)
$0.19105 t/s
Qwen3 30B A3B (Non-reasoning)
$0.3570 t/s
Qwen2.5 Turbo
$0.0972 t/s
Qwen2 Instruct 72B
$0.000 t/s
Qwen3 Omni 30B A3B Instruct
$0.43108 t/s
Qwen3 8B (Non-reasoning)
$0.3188 t/s
Qwen3.5 0.8B (Reasoning)
$0.02450 t/s
Qwen2.5 Coder Instruct 7B
$0.000 t/s
Qwen3.5 0.8B (Non-reasoning)
$0.02309 t/s
Qwen3 VL 4B Instruct
$0.000 t/s
Qwen1.5 Chat 110B
$0.000 t/s
Qwen Chat 72B
$0.000 t/s
Qwen3 1.7B (Reasoning)
$0.40140 t/s
Qwen Chat 14B
$0.000 t/s
Qwen3 1.7B (Non-reasoning)
$0.19141 t/s
Qwen3 0.6B (Reasoning)
$0.40196 t/s
Qwen3 0.6B (Non-reasoning)
$0.19187 t/s

NVIDIA

16 models

Amazon

13 models

Microsoft

4 models

Cohere

4 models

Moonshot

6 models

ai2

10 models

ai21-labs

7 models

baidu

2 models

bytedance_seed

2 models

databricks

1 models

deepcogito

1 models

ibm

7 models

inception

1 models

inclusionai

5 models

korea-telecom

2 models

kwaikat

2 models

lg

6 models

liquidai

8 models

longcat

1 models

mbzuai

4 models

minimax

6 models

motif-technologies

1 models

nanbeige

1 models

nous-research

7 models

openchat

1 models

perplexity

5 models

prime-intellect

1 models

reka-ai

2 models

sarvam

3 models

servicenow

2 models

snowflake

1 models

stepfun

2 models

swiss-ai-initiative

2 models

tii-uae

1 models

trillionlabs

2 models

upstage

6 models

xiaomi

5 models

zai

16 models