Live data from Artificial Analysis API

AI Model Benchmarks

Compare 511+ AI models across 12 benchmarks — Intelligence, Coding, Math, Science, and more. Data updated hourly.

Benchmarks:
511 models · click headers to sort
#
Model
Speed
$/1M
AA Index
GPQA
MMLU-Pro
LiveCode
AIME
HLE
1
60 t/s
$11.3
60.2
93.5%
44.3%
2
63 t/s
$11.3
58.9
93.2%
43.0%
3
49 t/s
$10.9
57.3
91.4%
39.6%
4
135 t/s
$4.5
57.2
94.1%
44.7%
5
79 t/s
$5.6
56.8
92.0%
41.6%
6
59 t/s
$11.3
56.7
92.6%
40.6%
7
227 t/s
$3.4
55.3
92.2%
41.0%
8
95 t/s
$1.7
53.9
91.1%
35.9%
9
54 t/s
$1.5
53.8
86.6%
33.8%
10
73 t/s
$4.8
53.6
91.5%
39.9%
11
94 t/s
$1.6
53.2
90.1%
35.0%
12
45 t/s
$10.9
52.9
89.6%
36.7%
13
$0.00
52.2
88.4%
39.9%
14
42 t/s
$10.9
51.8
88.5%
31.2%
15
37 t/s
$2.9
51.8
88.8%
28.9%
16
67 t/s
$6.6
51.7
87.5%
30.0%
17
29 t/s
$2.2
51.5
88.8%
35.9%
18
53 t/s
$2.1
51.4
86.8%
28.0%
19
67 t/s
$4.8
51.3
90.3%
87.4%
88.9%
99.0%
35.4%
20
58 t/s
$11.3
50.8
91.0%
31.0%
21
53 t/s
$1.1
50.0
88.2%
25.7%
22
30 t/s
$2.2
49.8
90.5%
33.5%
23
68 t/s
$1.6
49.8
82.0%
27.2%
24
54 t/s
$10.9
49.7
86.6%
89.5%
87.1%
91.3%
28.4%
25
44 t/s
$0.53
49.6
87.4%
28.1%
26
84 t/s
$3.0
49.3
91.1%
32.2%
27
70 t/s
$1.5
49.2
87.0%
28.3%
28
MiMo-V2.5
Xiaomi
89 t/s
$0.72
49.0
84.9%
25.2%
29
102 t/s
$4.8
49.0
89.9%
33.5%
30
157 t/s
$1.7
48.9
87.5%
26.6%
31
91 t/s
$3.0
48.5
88.5%
30.0%
32
141 t/s
$4.5
48.4
90.8%
89.8%
91.7%
95.7%
37.2%
33
63 t/s
$5.6
47.9
87.1%
28.9%
34
120 t/s
$3.4
47.7
87.3%
87.0%
86.8%
94.0%
26.5%
35
$0.00
46.8
84.7%
25.4%
36
36 t/s
$1.1
46.8
87.9%
29.4%
37
$4.8
46.6
86.4%
85.9%
89.4%
96.7%
24.9%
38
98 t/s
$0.17
46.5
89.4%
32.1%
39
43 t/s
$10.9
46.5
84.0%
18.6%
40
196 t/s
$1.1
46.4
89.8%
89.0%
90.8%
97.0%
34.7%
41
$0.17
46.0
86.7%
27.8%
42
63 t/s
$1.4
45.8
84.2%
21.6%
43
52 t/s
$1.4
45.0
89.3%
27.3%
44
110 t/s
$0.80
44.9
85.5%
20.4%
45
89 t/s
$3.4
44.6
85.4%
87.1%
84.6%
94.3%
26.5%
46
166 t/s
$3.4
44.6
83.7%
86.5%
84.0%
98.7%
25.6%
47
48 t/s
$6.6
44.4
79.9%
13.2%
48
155 t/s
$0.46
44.0
81.7%
26.5%
49
116 t/s
$0.53
43.8
85.5%
16.0%
50
48 t/s
$2.1
43.8
83.9%
25.6%
51
176 t/s
$0.56
43.5
84.1%
20.2%
52
112 t/s
$0.00
43.4
82.8%
19.9%
53
170 t/s
$3.4
43.1
86.0%
86.0%
84.9%
95.7%
23.4%
54
50 t/s
$10.9
43.1
81.0%
88.9%
73.8%
62.7%
12.9%
55
46 t/s
$6.6
43.0
83.4%
87.5%
71.4%
88.0%
17.3%
56
98 t/s
$1.7
42.9
78.8%
18.2%
57
$0.00
42.9
80.9%
15.8%
58
52 t/s
$6.6
42.6
79.7%
10.8%
59
85 t/s
$1.0
42.1
85.9%
85.6%
89.4%
95.0%
25.1%
60
90 t/s
$0.82
42.1
85.8%
22.2%
61
79 t/s
$3.4
42.0
84.2%
86.7%
70.3%
91.7%
23.5%
62
37 t/s
$32.8
42.0
80.9%
88.0%
65.4%
80.3%
11.9%
63
123 t/s
$0.20
41.9
86.7%
25.5%
64
87 t/s
$0.53
41.9
84.8%
19.1%
65
$0.34
41.7
84.0%
86.2%
86.2%
92.0%
22.2%
66
160 t/s
$1.1
41.6
85.7%
23.4%
67
150 t/s
$0.15
41.5
83.5%
20.0%
68
$11.0
41.5
87.7%
86.6%
81.9%
92.7%
23.9%
69
$4.5
41.3
88.7%
89.5%
85.7%
86.7%
27.6%
70
98 t/s
$0.69
41.2
82.8%
83.7%
83.8%
90.7%
19.7%
71
58 t/s
$11.3
40.9
76.8%
12.6%
72
113 t/s
$1.1
40.9
83.8%
84.8%
85.3%
94.7%
22.3%
73
o3-pro
OpenAI
22 t/s
$35.0
40.7
84.5%
74
51 t/s
$1.6
40.6
66.6%
7.2%
75
53 t/s
$1.4
40.1
86.1%
18.8%
76
45 t/s
$2.4
39.8
86.1%
26.2%
77
92 t/s
$0.53
39.4
83.0%
87.5%
81.0%
82.7%
22.2%
78
29 t/s
$2.2
39.3
71.7%
7.7%
79
36 t/s
$0.00
39.2
85.7%
22.7%
80
155 t/s
$3.0
39.2
74.8%
12.8%
81
64 t/s
$3.4
39.2
80.8%
86.0%
76.3%
83.0%
18.4%
82
148 t/s
$0.15
39.2
84.6%
84.3%
86.8%
96.3%
21.1%
83
33 t/s
$32.8
39.0
79.6%
87.3%
63.6%
73.3%
11.7%
84
110 t/s
$0.69
38.9
80.3%
82.8%
69.2%
85.0%
14.6%
85
49 t/s
$6.6
38.7
77.7%
84.2%
65.5%
74.3%
9.6%
86
$0.00
38.6
85.3%
85.4%
82.2%
89.3%
17.6%
87
56 t/s
$1.5
38.6
82.6%
13.9%
88
177 t/s
$0.69
38.6
81.3%
82.0%
83.6%
91.7%
16.9%
89
187 t/s
$0.00
38.5
82.6%
22.6%
90
Ring-2.6-1T
InclusionAI
$0.00
38.5
85.7%
18.3%
91
o3
OpenAI
86 t/s
$3.5
38.4
82.7%
85.3%
80.8%
88.3%
20.0%
92
150 t/s
$0.46
38.1
76.1%
14.7%
93
193 t/s
$0.15
37.8
83.1%
19.1%
94
150 t/s
$1.7
37.7
82.3%
17.1%
95
34 t/s
$1.2
37.3
78.9%
12.3%
96
90 t/s
$0.83
37.2
84.2%
13.2%
97
110 t/s
$2.2
37.1
67.2%
76.0%
61.5%
83.7%
9.7%
98
59 t/s
$1.4
37.1
82.9%
13.6%
99
46 t/s
$6.6
37.1
72.7%
86.0%
59.0%
37.0%
7.1%
100
131 t/s
$0.69
37.1
84.5%
19.7%
Showing top 100 of 511 models. Use search/filter to narrow down.

Benchmark Guide

Intelligence
Source ↗

Composite score across math, science, coding

Graduate-level science Q&A (Diamond)

MMLU-Pro
Source ↗

Knowledge & reasoning across 57 subjects

LiveCodeBench
Source ↗

Live coding benchmark with new problems

AIME 2025
Source ↗

American Invitational Math Exam

MATH-500
Source ↗

Competition-level math problems

Humanity's Last Exam - hardest questions

Composite coding benchmark score

Composite math benchmark score

SciCode
Source ↗

Scientific coding problems

IFBench
Source ↗

Instruction following benchmark

TerminalBench
Source ↗

Terminal/CLI task completion

Compare pricing for all models side by side

Open AI API Cost Calculator →