UABUnbiased AI BenchGlass box for model evals.
Every leaderboard, with receipts.
Home/Heatmap
Heatmap
Live · updated continuously
Home · heatmap explorer

The matrix,
scrubbable.

A denser mode for scanning broad coverage quickly. The values stay grounded in exact comparable groups even when the table becomes compact.
Surface · scrubbable matrix
Sources · 9
Theme · light / dark
Build / data stamp

Read this before trusting a headline.

Data snapshot May 1, 2026Registry verification passed9 providers · 826 tracked modelsPage refreshed May 7, 2026

If this stamp lags behind the repo, you are likely looking at an older build or cached deploy.

Sort models by
Cell shows
Modality
Coverage floor
795 models · 40 benchmarks
AR · ratingAR · ratingAR · ratingAR · ratingAR · ratingAR · ratingAR · ratingAR · ratingAR · ratingAR · ratingAR · ratingLB · %LB · %LB · %LB · %LB · %LB · %AA · indexAA · sAA · %AA · ratingAA · ratingAA · ratingAA · ratingBB · %BB · %BB · %BB · t/sBB · msTERMINAL-BENCH · %SL · %SL · %SL · %SL · %SL · %SL · %SL · %SL · %OC · %MTEB · ndcg
Claude Opus 4.6
Anthropic
100%
98.3%
98%
98.3%
100%
100%
94.6%
6.3%
100%
33.3%
100%
20%
40%
82.6%
33.3%
42.9%
30%
81.8%
100%
60%
100%
70%
42.9%
40.8%
GPT-5.4
OpenAI
95.9%
73.3%
70.3%
73.3%
63%
77.8%
77.4%
0%
20%
30%
66.7%
75%
10%
90%
60.9%
60%
71.4%
100%
100%
50%
20%
84.6%
90%
71.4%
38.3%
Gemini 2.5 Pro
Google
97.2%
8.3%
88.1%
8.3%
33.3%
27.8%
100%
96.8%
96.8%
80.6%
100%
100%
67%
7.5%
39.1%
86.7%
100%
90%
27.3%
33.3%
38.5%
10%
100%
35.9%
Claude Opus 4.7
Anthropic
99.1%
100%
100%
100%
92.6%
94.4%
98%
8.8%
70%
0%
12.5%
40%
80%
43.5%
33.3%
42.9%
80%
42.9%
28.4%
GPT-5.4 mini
OpenAI
68%
61.7%
58.4%
61.7%
14.8%
48.8%
0.8%
20%
20%
55.6%
25%
80%
100%
34.8%
60%
71.4%
60%
45.5%
66.7%
46.2%
40%
71.4%
27.8%
GPT-5.5
OpenAI
97.5%
81.7%
95%
81.7%
96.3%
83.3%
87.2%
1.3%
0%
88.9%
50%
60%
70%
100%
100%
27.3%
Claude Sonnet 4.6
Anthropic
94.3%
95%
94.1%
95%
88.9%
88.9%
90.9%
1.7%
90%
88.9%
75%
30%
50%
13.3%
21.4%
0%
21.4%
26%
GPT-5.4 nano
OpenAI
53.5%
56.7%
45.5%
56.7%
14.8%
27.6%
2.5%
20%
10%
22.2%
70%
60%
4.3%
60%
71.4%
60%
45.5%
66.7%
46.2%
40%
71.4%
22.6%
GPT-5
OpenAI
70.9%
56.7%
70.3%
56.7%
14.8%
50.5%
3.3%
20%
60.9%
60%
71.4%
60%
45.5%
66.7%
46.2%
40%
71.4%
21.6%
Grok 4.3
xAI
86.7%
65%
90.1%
65%
74.1%
99%
12.6%
60%
44.4%
62.5%
100%
20%
19.5%
o1
OpenAI
65.5%
66.3%
77.4%
93.5%
93.5%
90.3%
77.4%
80.6%
69.4%
4.6%
18%
Grok 4.20
xAI
94.9%
60%
87.1%
60%
70.4%
22.2%
66%
7.9%
80%
11.1%
37.5%
90%
30%
0%
17.9%
Claude Sonnet 3.7
Anthropic
49.7%
57.4%
93.5%
90.3%
87.1%
87.1%
83.9%
93.5%
69.4%
17.8%
DeepSeek Reasoner
DeepSeek
68%
87.1%
71%
77.4%
71%
96.8%
64.5%
44.1%
93.3%
0%
30.8%
0%
17.6%
GPT-4.5 Preview
OpenAI
80.1%
79.2%
96.8%
87.1%
90.3%
45.2%
87.1%
96.8%
16.6%
GPT-5.1
OpenAI
84.2%
18.3%
83.2%
18.3%
63%
5.6%
61.6%
3.8%
0%
78.3%
50%
91.7%
69.2%
30%
16.4%
Claude Haiku 4.5
Anthropic
71.5%
31.7%
37.6%
31.7%
16.7%
54.8%
32.3%
29%
19.4%
48.4%
64.5%
69.4%
93.3%
30.4%
15.8%
Claude Sonnet 4.5
Anthropic
88.6%
50%
63.4%
50%
44.4%
55.6%
81.5%
74.1%
56.5%
13.3%
21.4%
0%
21.4%
15.5%
Grok 3
xAI
86.4%
61.3%
83.9%
80.6%
61.3%
61.3%
58.1%
56.9%
60.7%
15.3%
o3 mini
OpenAI
52.8%
90.3%
61.3%
64.5%
67.7%
93.5%
83.9%
56.9%
5.4%
14.4%
Claude Opus 4.5
Anthropic
93%
86.7%
86.7%
59.3%
72.2%
90.9%
10%
73.9%
14.3%
o1 Preview
OpenAI
60.8%
51.6%
100%
100%
100%
67.7%
35.5%
54.9%
14.3%
GPT-4o
OpenAI
44.3%
32.7%
0%
64.5%
74.2%
71%
54.8%
54.8%
74.2%
27.6%
71.5%
14.2%
Kimi K2.6
Unknown
96.5%
93.3%
93.1%
93.3%
61.1%
99.3%
28%
14.1%
GPT-5.2
OpenAI
79.1%
36.7%
82.2%
36.7%
55.6%
0%
75.4%
0.4%
100%
82.6%
13.7%
Claude Sonnet 3.5
Anthropic
43.4%
43.6%
83.9%
77.4%
83.9%
38.7%
71%
90.3%
13.3%
Claude Opus 4
Anthropic
64.6%
46.7%
62.4%
46.7%
7.4%
73.7%
64.4%
43.5%
33.3%
42.9%
42.9%
13.2%
Claude Opus 4.1
Anthropic
80.1%
46.7%
46.7%
40.7%
78.8%
64.9%
43.5%
33.3%
42.9%
42.9%
13%
o3
OpenAI
76.9%
75.2%
37%
83.2%
13.4%
100%
36.4%
58.3%
20%
12.5%
Dreamina Seedance 2.0 720p
Unknown
100%
100%
100%
98.6%
98.4%
12.4%
Gemini 3 Flash
Google
94.3%
53.3%
91.1%
53.3%
77.8%
11.1%
77.4%
17.6%
11.9%
HappyHorse-1.0
Unknown
97.4%
97.4%
80%
100%
100%
11.9%
GLM-5.1
Zhipu
97.8%
96.7%
96.7%
97.6%
69%
11.4%
muse-spark
Unknown
98.7%
91.7%
99%
91.7%
61.1%
11.1%
GLM-4.7
Zhipu
90.2%
78.3%
78.3%
89.2%
74.5%
10.3%
Qwen3.5 397B A17B
Qwen
92.4%
51.7%
86.1%
51.7%
86.9%
39.7%
10.2%
DeepSeek Chat
DeepSeek
85.1%
35%
35%
45.2%
19.4%
12.9%
16.1%
29%
54.8%
71.4%
10.1%
MiMo-V2.5-Pro
Unknown
96.8%
90%
90%
99.3%
25.1%
10%
GLM-5
Zhipu
92.7%
73.3%
73.3%
96.3%
57.7%
9.8%
GPT-4 Turbo
OpenAI
39.2%
31.7%
41.9%
48.4%
51.6%
25.8%
45.2%
48.4%
27.6%
32.6%
9.8%
Qwen3.6 Plus
Qwen
91.1%
86.7%
86.7%
96.3%
30.5%
9.8%
Gemini 3.1 Pro
Google
85.2%
20%
90.9%
41.7%
40%
0%
100%
9.4%
GPT Image 2 (high)
OpenAI
100%
100%
100%
75%
9.4%
Claude Haiku 3.5
Anthropic
38%
37.6%
54.8%
32.3%
29%
19.4%
48.4%
64.5%
44.1%
9.2%
kimi-k2.5-thinking
Unknown
93%
70%
89.1%
70%
44.4%
9.2%
Qwen3.5 122B A10B
Qwen
82%
41.7%
78.2%
41.7%
78.8%
37.2%
9%
MiniMax-M2.7
Unknown
72.2%
68.3%
68.3%
96.3%
54.4%
9%
o1 mini
OpenAI
52.2%
48.4%
58.1%
54.8%
48.4%
80.6%
12.9%
8.9%
MiMo-V2-Pro
Unknown
89.9%
70%
70%
96%
21.3%
8.7%
Qwen3.5 27B
Qwen
75.9%
38.3%
79.2%
38.3%
81.5%
18.4%
8.3%
Claude Sonnet 4
Anthropic
58.2%
63.4%
73.7%
77.4%
13.3%
21.4%
0%
21.4%
8.2%
Grok 4
xAI
77.5%
58.4%
29.6%
89.2%
9.6%
0%
10%
26.1%
0%
14.3%
7.1%
8%
GPT-OSS 120B
OpenAI
65.2%
54.9%
87.9%
13%
8.3%
92.3%
8%
GPT-5.3 Codex
OpenAI
63.3%
63.3%
93.6%
2.9%
95.7%
8%
Claude Opus 3
Anthropic
39.2%
22.8%
19.4%
67.7%
74.2%
12.9%
16.1%
22.6%
42.1%
7.9%
mimo-v2.5
Unknown
79.4%
80%
76.2%
80%
7.9%
GLM-4.6
Zhipu
90.8%
40%
40%
67%
71.5%
7.7%
MiniMax-M2.5
Unknown
66.8%
45%
45%
89.2%
58.2%
7.6%
kimi-k2.5-instant
Unknown
83.5%
65%
84.2%
65%
7.4%
PixVerse V5.6
Unknown
57.9%
60.5%
87.3%
85.7%
7.3%
Grok 4.1 Fast
xAI
76.9%
13.3%
60.4%
13.3%
51.9%
54.9%
18%
7.2%
GPT-4o mini
OpenAI
46.5%
29.7%
32.3%
22.6%
16.1%
29%
29%
35.5%
22.2%
23.4%
7.2%
Kling 2.5 Turbo 1080p
Unknown
52.6%
55.3%
81.7%
90.5%
7%
GPT-4.1
OpenAI
70.3%
73.3%
59.3%
76.6%
7%
Grok 3 mini
xAI
63.6%
71%
9.7%
22.6%
3.2%
87.1%
16.1%
6.8%
Gemma 4 31B
Google
91.5%
33.3%
84.2%
61.5%
6.8%
GPT-OSS 20B
OpenAI
46.8%
48.8%
94.6%
0%
76.9%
6.7%
GPT-4.1 mini
OpenAI
58.9%
68.3%
52.2%
85.8%
6.6%
Trinity Large Thinking
Unknown
63.6%
20%
20%
71.4%
89.5%
6.6%
deepseek-v4-pro-thinking
DeepSeek
94%
85%
85%
6.6%
Grok 2
xAI
50.6%
29%
45.2%
41.9%
35.5%
29%
32.3%
6.6%
DeepSeek V4 Flash (Max)
DeepSeek
86.7%
95.3%
79.9%
6.5%
Vidu Q3 Pro
Unknown
78.9%
90.1%
92.1%
6.5%
MiniMax-M2
Unknown
59.5%
30%
30%
78.8%
62.3%
6.5%
GPT-4
OpenAI
27.2%
12.9%
51.6%
67.7%
41.9%
12.9%
22.6%
22.2%
6.5%
Kling 2.6 Pro (January)
Unknown
50%
57.9%
73.2%
71.4%
6.3%
Grok Imagine Video
xAI
60%
94.4%
96.8%
6.3%
Veo 3
Unknown
63.2%
44.7%
85.9%
54%
6.2%
DeepSeek V4 Pro (Max)
DeepSeek
93%
98%
55.2%
6.2%
Mistral Large 3
Mistral
88.6%
11.7%
11.7%
52.2%
81.6%
6.1%
Gemini 2.5 Flash
Google
76.6%
72.3%
48.8%
9.2%
8.7%
25%
6%
Qwen3 VL 235B A22B
Qwen
83.5%
74.3%
48.8%
33.9%
6%
Grok 2 mini
xAI
43.4%
16.1%
29%
35.5%
74.2%
25.8%
16.1%
6%
o4 mini
OpenAI
60.8%
67.3%
73.7%
6.7%
30%
6%
GPT Image 1.5
OpenAI
53.6%
9.3%
75%
100%
5.9%
Grok Beta
xAI
35.5%
38.7%
48.4%
32.3%
29%
51.6%
5.9%
Step 3.5 Flash 2603
StepFun
75.3%
83.2%
77%
5.9%
Qwen3.5 35B A3B
Qwen
73.7%
21.7%
21.7%
69.4%
46%
5.8%
MiMo-V2-Flash
Unknown
73.4%
28.3%
28.3%
67%
33.5%
5.8%
NVIDIA Nemotron 3 Super
Unknown
69.6%
78.8%
69.5%
5.4%
Runway Gen-4.5
Unknown
55.3%
84.5%
77.8%
5.4%
Qwen3-Coder 480B A35B
Qwen
62%
25%
25%
56.9%
28.5%
17.4%
5.4%
MiMo-V2-Omni
Unknown
69.3%
90.9%
51.9%
5.3%
Hailuo 2.3
Unknown
36.8%
44.7%
63.4%
58.7%
5.1%
GLM-4.7-Flash
Zhipu
60.8%
67%
75.3%
5.1%
Ray 3
Unknown
44.7%
28.9%
76.1%
52.4%
5.1%
DeepSeek V3.2 Exp
DeepSeek
84.2%
26.7%
26.7%
63.3%
5%
Grok 4 Fast
xAI
74.1%
3.3%
3.3%
48.1%
52.2%
15.5%
4.9%
KAT-Coder-Pro V1
Unknown
23.3%
23.3%
78.8%
67.4%
4.8%
GPT-4.1 nano
OpenAI
46.2%
27.7%
22.2%
95.4%
4.8%
Hailuo 02 Pro
Unknown
39.5%
31.6%
54.9%
65.1%
4.8%
SkyReels V4
Unknown
95.8%
92.1%
4.7%
dola-seed-2.0-pro
Unknown
96.2%
91.1%
4.7%
veo-3.1-audio-1080p
Unknown
94.7%
92.1%
4.7%
Kling 3.0 1080p (Pro)
Unknown
97.2%
88.9%
4.7%
PixVerse V6
Unknown
88.7%
95.2%
4.6%
gpt-image-1.5-high-fidelity
OpenAI
94.6%
88.4%
4.6%
Hailuo 02 Standard
Unknown
31.6%
23.7%
64.8%
61.9%
4.5%
minimax-m2.1-preview
Unknown
71.8%
55%
55%
4.5%
Kling 3.0 Omni 1080p (Pro)
Unknown
93%
87.3%
4.5%
GLM-4.6V
Zhipu
68.7%
54.5%
39.7%
13.8%
4.4%
grok-imagine-video-720p
xAI
81.6%
94.7%
4.4%
veo-3.1-audio
Unknown
86.8%
89.5%
4.4%
Qwen3 235B 2507
Qwen
81.3%
56.9%
36.4%
4.4%
veo-3.1-fast-audio-1080p
Unknown
92.1%
81.6%
4.3%
GLM-4.5
Zhipu
88.3%
59.3%
25.5%
4.3%
Grok Imagine Image
xAI
87.5%
83.7%
4.3%
veo-3.1-fast-audio
Unknown
84.2%
86.8%
4.3%
Qwen3 Next 80B A3B
Qwen
81.3%
47.1%
41%
4.2%
Grok 3 mini Reasoning (high)
xAI
71.4%
97.5%
4.2%
deepseek-v3.2-thinking
DeepSeek
82%
43.3%
43.3%
4.2%
Grok Imagine Image Pro
xAI
82.1%
86%
4.2%
Qwen3.5 9B
Qwen
71.4%
95.8%
4.2%
Mercury 2
Unknown
62.7%
5%
5%
73.7%
20.1%
4.2%
Kling 3.0 Omni 720p (Standard)
Unknown
91.5%
74.6%
4.2%
Grok 4.20 0309
xAI
67%
94.1%
4%
Kling 3.0 720p (Standard)
Unknown
83.1%
77.8%
4%
Gemma 4 26B A4B
Google
89.6%
69.4%
4%
Veo 3.1 Fast
Unknown
77.5%
81%
4%
DeepSeek V4 Flash
DeepSeek
78.8%
78.7%
3.9%
Qwen3 Max
Qwen
78.2%
59.3%
19.7%
3.9%
DeepSeek V4 Pro (High)
DeepSeek
96.3%
59.8%
3.9%
PixVerse V5.5
Unknown
74.6%
81%
3.9%
Sora 2 Pro
Unknown
86.8%
67.6%
3.9%
QwQ-32B
Unknown
54.7%
47.1%
52.3%
3.9%
Gemini 2.5 Flash-Lite
Google
66.8%
56.4%
22.2%
8.4%
3.8%
GLM-4.5-Air
Zhipu
70.3%
52.2%
31%
3.8%
Step 3.5 Flash
StepFun
83.2%
69.5%
3.8%
qwen-image-2.0-pro-2026-04-22
Qwen
85.7%
65.1%
3.8%
Qwen3.5 4B
Qwen
61.6%
88.7%
3.8%
Kling 2.6 Standard (January)
Unknown
69%
81%
3.7%
seedream-4.5
Unknown
69.6%
79.1%
3.7%
Veo 3.1
Unknown
80.3%
68.3%
3.7%
Mistral Medium 3.5
Mistral
84.2%
64%
3.7%
GLM-5.1
Zhipu
92.9%
55.2%
3.7%
Llama 4 Maverick
Meta
42.1%
82.4%
0%
0%
23.1%
0%
3.7%
Kimi K2 Thinking
Unknown
87.2%
60.3%
3.7%
veo-3-fast-audio
Unknown
78.9%
68.4%
3.7%
Devstral 2
Mistral
6.7%
6.7%
50.5%
83.3%
3.7%
GLM-4.7
Zhipu
75.4%
71.5%
3.7%
flux-2-max
Unknown
83.9%
62.8%
3.7%
Qwen3 Coder Next
Qwen
63.3%
82.8%
3.7%
DeepSeek V3.1 Terminus
DeepSeek
80.1%
66%
3.7%
kimi-k2-thinking-turbo
Unknown
78.8%
33.3%
33.3%
3.6%
DeepSeek V3.1
DeepSeek
82%
63.3%
3.6%
veo-3-audio
Unknown
73.7%
71.1%
3.6%
MiMo-V2-Omni-0327
Unknown
93.6%
50.6%
3.6%
P-Video
Unknown
42.1%
39.5%
36.6%
25.4%
3.6%
Ling-flash-2.0
Unknown
63.3%
36%
43.5%
3.6%
MiniMax-M2.1
Unknown
84.2%
58.2%
3.6%
GLM-4.6
Zhipu
73.7%
68.2%
3.5%
qwen3-vl-235b-a22b-thinking
Qwen
75.3%
65.3%
3.5%
mistral-medium-2508
Mistral
86.7%
52.5%
3.5%
KAT-Coder-Pro V2
Unknown
92.9%
45.2%
3.5%
Olmo 3.1 32B Instruct
Unknown
51.9%
18.5%
66.9%
3.4%
DeepSeek V4 Pro
DeepSeek
84.2%
53.1%
3.4%
Qwen3.5 4B
Qwen
52.2%
84.5%
3.4%
seedream-4-2k
Unknown
67.9%
67.4%
3.4%
PixVerse V5
Unknown
60.6%
74.6%
3.4%
Magistral Small 1.2
Mistral
42.1%
92.9%
3.4%
Qwen3.6 35B A3B
Qwen
90.9%
43.5%
3.4%
GLM-5
Zhipu
87.2%
46.9%
3.4%
flux-2-pro
Unknown
80.4%
53.5%
3.3%
NVIDIA Nemotron 3 Nano
Unknown
54.9%
77.8%
3.3%
Mistral Small 4
Mistral
44.1%
88.3%
3.3%
GLM-4.5V
Zhipu
57%
48.5%
22.2%
4.2%
3.3%
Seedance 1.5 pro
Unknown
62%
69.8%
3.3%
Ring-flash-2.0
Unknown
55.4%
27.6%
48.5%
3.3%
Ministral 3 14B
Mistral
36%
95%
3.3%
Nova 2.0 Pro Preview
Unknown
52.2%
78.7%
3.3%
MiMo-V2-Flash
Unknown
84.2%
45.6%
3.2%
hunyuan-vision-1.5-thinking
Tencent
76.3%
53.5%
3.2%
seedance-v1.5-pro
Unknown
65.8%
63.2%
3.2%
Ministral 3 8B
Mistral
31.3%
97.5%
3.2%
Sora 2 (December)
Unknown
71.1%
57.7%
3.2%
Devstral Small
Mistral
31.3%
96.7%
3.2%
NVIDIA Nemotron Nano 9B V2
Unknown
31.3%
96.7%
3.2%
Mistral Small 3.2
Mistral
31.3%
95.8%
3.2%
Qwen3 32B
Qwen
58.9%
31.3%
36.8%
3.2%
seedream-5.0-lite
Unknown
57.1%
69.8%
3.2%
wan2.7-image-pro
Unknown
46.4%
79.1%
3.1%
Magistral Medium 1.2
Mistral
61.6%
63.6%
3.1%
Ling 2.6 Flash
Unknown
59.3%
65.3%
3.1%
Kling O1 Pro (January)
Unknown
44.7%
0%
78.9%
3.1%
Wan 2.5 Preview
Unknown
56.3%
66.7%
3.1%
Kimi K2.5
Unknown
95.3%
27.2%
3.1%
Kimi K2
Unknown
59.3%
63.2%
3.1%
NVIDIA Nemotron 3 Nano
Unknown
22.2%
100%
3.1%
Qwen3.5 Omni Plus
Qwen
84.2%
37.2%
3%
wan2.7-image
Unknown
44.6%
76.7%
3%
Wan 2.6
Unknown
70.4%
50.8%
3%
MiMo-V2-Flash (Feb 2026)
Unknown
87.2%
33.9%
3%
Kimi K2.6
Unknown
90.9%
30.1%
3%
Nova 2.0 Lite
Unknown
42.1%
78.7%
3%
Qwen3.6 Max Preview
Qwen
98%
22.6%
3%
Llama Nemotron Super 49B v1.5
Meta
44.1%
75.7%
3%
DeepSeek V3 0324
DeepSeek
68.7%
50.5%
3%
Qwen3.5 2B
Qwen
31.3%
87%
3%
Pixtral Large
Unknown
28.7%
27.6%
61.9%
3%
flux-2-flex
Unknown
78.6%
39.5%
3%
Llama 4 Scout
Meta
27.6%
90.4%
2.9%
Mistral Medium 3
Mistral
44.1%
73.6%
2.9%
Devstral Small 2
Mistral
44.1%
73.2%
2.9%
Devstral Medium
Mistral
44.1%
72.8%
2.9%
GLM-4.7-Flash
Zhipu
50.5%
66.1%
2.9%
mistral-medium-2505
Mistral
67.4%
48.5%
2.9%
Qwen3.6 27B
Qwen
94.6%
20.9%
2.9%
qwen-image-2.0-2026-03-03
Qwen
57.1%
58.1%
2.9%
Ministral 3 3B
Mistral
15.5%
99.6%
2.9%
Seedance 1.0
Unknown
53.5%
60.3%
2.8%
Claude Sonnet 3
Anthropic
31.6%
12.9%
3.2%
12.9%
25.8%
9.7%
6.5%
0%
11.1%
2.8%
gemma-3-27b-it
Google
63%
50.5%
2.8%
Hy3-preview
Unknown
89.2%
23.8%
2.8%
flux-2-dev
Unknown
73.2%
39.5%
2.8%
Qwen3.5 Omni Flash
Qwen
59.3%
53.1%
2.8%
Granite 4.1 8B
Unknown
18.5%
93.7%
2.8%
Mistral Small 3.1
Mistral
27.6%
84.5%
2.8%
Mistral Small 3
Mistral
22.2%
89.5%
2.8%
NVIDIA Nemotron Nano 12B v2 VL
Unknown
31.3%
80.3%
2.8%
Qwen3.6 35B A3B
Qwen
71.4%
40.2%
2.8%
Qwen3 Next 80B A3B
Qwen
61.6%
49%
2.8%
veo-3-fast
Unknown
60.5%
50%
2.8%
Grok 4.1
xAI
90.5%
10%
10%
2.8%
Kimi K2.5
Unknown
81.5%
28.9%
2.8%
Qwen3.5 0.8B
Qwen
11.1%
99.2%
2.8%
LFM2 24B A2B
Unknown
11.1%
98.7%
2.7%
Nova 2.0 Omni
Unknown
39.7%
69.5%
2.7%
step-1o-turbo-202506
StepFun
57.6%
50.5%
2.7%
Grok Code Fast
xAI
1.7%
1.7%
66%
16.3%
21.7%
2.7%
Llama 3.1 8B
Meta
18.5%
88.7%
2.7%
Mistral 7B
Mistral
5.1%
3.7%
98.3%
2.7%
Mistral Medium 3.1
Mistral
48.8%
58.2%
2.7%
Hermes 4 70B
Unknown
36%
70.7%
2.7%
Nova Lite
Unknown
22.2%
84.1%
2.7%
Llama Nemotron Super 49B v1.5
Meta
31.3%
74.5%
2.6%
Claude 4.5 Sonnet
Anthropic
90.9%
14.6%
2.6%
MiMo-V2.5-Pro
Unknown
78.8%
26.4%
2.6%
Qwen3.5 Flash
Qwen
74.7%
15%
15%
2.6%
NVIDIA Nemotron Nano 9B V2
Unknown
22.2%
81.6%
2.6%
step-3
StepFun
60.1%
43.6%
2.6%
Qwen3.6 27B
Qwen
81.5%
20.5%
2.5%
Phi-4 Multimodal
Unknown
11.1%
90.4%
2.5%
Kling 2.1 Master
Unknown
52.1%
49.2%
2.5%
mistral-small-2506
Mistral
58.5%
42.6%
2.5%
Claude 4.1 Opus
Anthropic
89.2%
11.7%
2.5%
Llama 3.2 11B (Vision)
Meta
9.1%
91.6%
2.5%
Veo 2
Unknown
21.1%
10.5%
47.9%
20.6%
2.5%
BAAI bge-large-en-v1.5
BAAI
100%
2.5%
Qwen2.5 72B
Qwen
41.1%
36%
22.2%
2.5%
Vidu Q2 Turbo
Unknown
42.1%
57.1%
2.5%
Qwen3 235B A22B 2507
Qwen
67%
31.8%
2.5%
ernie-5.1-preview
Unknown
98.4%
2.5%
Claude 4 Opus
Anthropic
84.2%
14.2%
2.5%
qwen3.5-max-preview
Qwen
98.1%
2.5%
Phi-4 Mini
Unknown
5.7%
92.1%
2.4%
chatgpt-image-latest-high-fidelity (20251216)
Unknown
97.7%
2.4%
Nova Micro
Unknown
11.1%
86.2%
2.4%
Hy3-preview
Unknown
75.4%
21.8%
2.4%
Kimi K2 0905
Unknown
69.4%
27.2%
2.4%
Mistral Small (Sep)
Mistral
11.1%
85.4%
2.4%
Llama 3.2 90B (Vision)
Meta
18.5%
77.8%
2.4%
Claude 4 Sonnet
Anthropic
84.2%
12.1%
2.4%
Mistral Medium
Mistral
24.4%
9.1%
62.3%
2.4%
ernie-5.0-0110
Unknown
95.6%
2.4%
LTX-2 Pro Open Weights
Unknown
50.7%
44.4%
2.4%
amazon-nova-experimental-chat-26-02-10
Amazon
94.9%
2.4%
Qwen3 VL 30B A3B
Qwen
47.1%
47.7%
2.4%
Llama 3.3 70B
Meta
27.6%
66.5%
2.4%
DeepSeek V4 Flash (High)
DeepSeek
93.6%
2.3%
Qwen3 235B A22B
Qwen
65.5%
6.7%
7.1%
14.3%
2.3%
Hermes 4 70B
Unknown
22.2%
70.7%
2.3%
DeepSeek R1 Distill Llama 70B
DeepSeek
36%
56.9%
2.3%
Claude 4.5 Haiku
Anthropic
81.5%
11.3%
2.3%
Mistral Small (Feb)
Mistral
9.1%
83.7%
2.3%
Jamba 1.6 Mini
Unknown
5.7%
87%
2.3%
Gemma 3 1B
Google
0.7%
92.1%
2.3%
llama-4-maverick-17b-128e-instruct
Meta
47.2%
45.5%
2.3%
DeepSeek V3 (Dec)
DeepSeek
56.6%
36%
2.3%
Mistral Large 2 (Nov)
Mistral
31.3%
61.1%
2.3%
Llama 3.2 3B
Meta
11.1%
81.2%
2.3%
Phi-4
Unknown
31.3%
11.1%
49.8%
2.3%
Qwen3 VL 32B
Qwen
56.9%
34.7%
2.3%
ernie-5.0-preview-1203
Unknown
91.5%
2.3%
qwen3.6-max-preview
Qwen
91.5%
2.3%
NVIDIA Nemotron Nano 12B v2 VL
Unknown
11.1%
80.3%
2.3%
Cogito v2.1
Unknown
91.2%
2.3%
mai-image-2
Unknown
91.1%
2.3%
Qwen3 Max Thinking
Qwen
71.4%
19.2%
2.3%
LTX-2 Fast Open Weights
Unknown
49.3%
41.3%
2.3%
Qwen3 VL 235B A22B
Qwen
63.3%
26.8%
2.3%
Nova 2.0 Pro Preview (medium)
Unknown
78.8%
10.9%
2.2%
o3-pro
OpenAI
87.2%
2.1%
2.2%
reve-v1.5
Unknown
89.3%
2.2%
Qwen3 Omni 30B A3B
Qwen
36%
52.7%
2.2%
ernie-5.0-preview-1022
Unknown
88.6%
2.2%
Qwen3 30B A3B 2507
Qwen
50.5%
38.1%
2.2%
Nova 2.0 Pro Preview (low)
Unknown
71.4%
17.2%
2.2%
Pika 2.5
Unknown
40.8%
47.6%
2.2%
deepseek-r1-0528
DeepSeek
88%
2.2%
seedream-4-fal
Unknown
55.4%
32.6%
2.2%
Nova 2.0 Lite (high)
Unknown
77.4%
10.5%
2.2%
hunyuan-large-vision
Tencent
39.9%
47.5%
2.2%
Llama 3.2 1B
Meta
0.7%
86.2%
2.2%
seedance-v1-pro
Unknown
34.2%
52.6%
2.2%
longcat-flash-chat-2602-exp
Unknown
86.7%
2.2%
seedream-4-high-res-fal
Unknown
51.8%
34.9%
2.2%
Sarvam 105B (high)
Unknown
42.1%
44.4%
2.2%
deepseek-v4-flash-thinking
DeepSeek
86.1%
2.2%
deepseek-v3.2-exp-thinking
DeepSeek
85.1%
2.1%
Hermes 4 405B
Unknown
44.1%
40.6%
2.1%
grok-imagine-video-480p
xAI
84.2%
2.1%
longcat-flash-chat
Unknown
84.2%
2.1%
Qwen3 VL 30B A3B
Qwen
36%
48.1%
2.1%
Jamba 1.6 Large
Unknown
15.5%
68.2%
2.1%
amazon-nova-experimental-chat-12-10
Amazon
83.2%
2.1%
Hermes 4 405B
Unknown
42.1%
41%
2.1%
llama-4-scout-17b-16e-instruct
Meta
43.4%
39.6%
2.1%
Qwen3 30B A3B 2507
Qwen
31.3%
51.5%
2.1%
mistral-small-3.1-24b-instruct-2503
Mistral
43%
39.6%
2.1%
deepseek-v3.1-terminus-thinking
DeepSeek
82%
2%
GLM-4.6V
Zhipu
52.2%
29.7%
2%
glm-5v-turbo
Zhipu
81.2%
2%
Jamba 1.7 Large
Unknown
15.5%
65.7%
2%
Seed-OSS-36B-Instruct
Unknown
56.9%
24.3%
2%
Llama 3.1 405B
Meta
39.7%
41%
2%
Qwen3 VL 8B
Qwen
39.7%
41%
2%
Kling 2.0
Unknown
42.3%
38.1%
2%
Qwen3 Coder 30B A3B
Qwen
47.1%
33.1%
2%
deepseek-v3.1-thinking
DeepSeek
80.1%
2%
Nova 2.0 Lite (medium)
Unknown
67%
13%
2%
LTX-2.3 Fast Open Weights
Unknown
46.5%
33.3%
2%
molmo-2-8b
Unknown
49.1%
30.7%
2%
amazon-nova-experimental-chat-11-10
Amazon
79.4%
2%
Vidu Q2 Pro
Unknown
23.7%
55.6%
2%
LTX-2.3 Pro Open Weights
Unknown
43.7%
34.9%
2%
qwen3-235b-a22b-thinking-2507
Qwen
78.2%
2%
Qwen3 235B
Qwen
47.1%
31%
2%
hunyuan-hy3-preview
Tencent
77.8%
1.9%
GLM-4.5V
Zhipu
31.3%
46.4%
1.9%
Claude 3.7 Sonnet
Anthropic
77.4%
1.9%
Gemma 3n E4B
Google
0.7%
75.7%
1.9%
kling-v3-pro
Unknown
76.3%
1.9%
wan2.6-t2v
Unknown
76.3%
1.9%
ernie-5.0-preview-1220
Unknown
76.2%
1.9%
Command A
Unknown
22.2%
54%
1.9%
Llama 3.1 70B
Meta
18.5%
57.3%
1.9%
DeepSeek V3.1 Terminus
DeepSeek
75.4%
1.9%
Ling-2.6-1T
Unknown
75.4%
1.9%
hunyuan-image-3.0
Tencent
75%
1.9%
Nano Banana 2 (Gemini 3.1 Flash Image Preview)
Unknown
50%
25%
1.9%
Nano Banana Pro (Gemini 3 Pro Image)
Unknown
25%
50%
1.9%
hunyuan-t1-20250711
Tencent
74.7%
1.9%
Qwen3 VL 32B
Qwen
39.7%
34.7%
1.9%
amazon-nova-experimental-chat-26-01-10
Amazon
74.4%
1.9%
wan2.5-i2v-preview
Unknown
73.7%
1.8%
Nova 2.0 Lite (low)
Unknown
56.9%
16.7%
1.8%
TeleVideo 2.0
Unknown
73%
1.8%
flux-2-klein-9b
Unknown
35.7%
37.2%
1.8%
amazon-nova-experimental-chat-10-20
Amazon
72.8%
1.8%
qwen3-235b-a22b-no-thinking
Qwen
72.8%
1.8%
mai-1-preview
Unknown
72.5%
1.8%
Qwen3 30B
Qwen
31.3%
41%
1.8%
hunyuan-image-3.0-instruct
Tencent
72.1%
1.8%
Kling O1 Standard (January)
Unknown
71.8%
1.8%
Llama 3.1 Nemotron 70B
Meta
22.2%
49.4%
1.8%
Qwen3 Omni 30B A3B
Qwen
15.5%
56.1%
1.8%
imagen-ultra-4.0-generate-001
Unknown
71.4%
1.8%
qwen3-30b-a3b-instruct-2507
Qwen
71.2%
1.8%
LongCat Flash Lite
Unknown
54.9%
15.9%
1.8%
Qwen3 VL 8B
Qwen
27.6%
43.1%
1.8%
kimi-k2-0905-preview
Unknown
69.9%
1.7%
PixVerse V4.5
Unknown
39.4%
30.2%
1.7%
Sarvam 30B (high)
Unknown
18.5%
51%
1.7%
hunyuan-turbos-20250416
Tencent
69.3%
1.7%
Olmo 3.1 32B Think
Unknown
41.5%
27.6%
1.7%
Qwen3 235B
Qwen
39.7%
29.3%
1.7%
Nova Premier
Unknown
44.1%
24.7%
1.7%
Claude Haiku 3
Anthropic
29.4%
10.9%
0%
3.2%
6.5%
0%
0%
0%
18.5%
1.7%
Gemma 3 4B
Google
0.7%
67.8%
1.7%
kimi-k2-0711-preview
Unknown
67.7%
1.7%
Llama Nemotron Ultra
Meta
31.3%
36%
1.7%
Seedance 1.0 Mini
Unknown
38%
28.6%
1.7%
qwen3-next-80b-a3b-thinking
Qwen
66.5%
1.7%
Veo 3 Fast Preview
Unknown
66.2%
1.7%
qwen-image-2512
Qwen
66.1%
1.7%
Gemma 3 27B
Google
11.1%
54.8%
1.6%
Wan 2.2 A14B Open Weights
Unknown
43.7%
22.2%
1.6%
wan2.5-t2v-preview
Unknown
65.8%
1.6%
wan2.6-i2v
Unknown
65.8%
1.6%
Hermes 3 - Llama-3.1 70B
Unknown
15.5%
50.2%
1.6%
qwen2.5-max
Qwen
65.5%
1.6%
amazon-nova-experimental-chat-10-09
Amazon
64.6%
1.6%
Hailuo 02 Fast
Unknown
18.4%
46%
1.6%
wan2.6-t2i
Unknown
64.3%
1.6%
nova-2-lite
Unknown
63.6%
1.6%
Hailuo 2.3 Fast
Unknown
63.5%
1.6%
Apriel-v1.5-15B-Thinker
Unknown
63.3%
1.6%
Apriel-v1.6-15B-Thinker
Unknown
63.3%
1.6%
DeepSeek V3.1
DeepSeek
63.3%
1.6%
Nova 2.0 Omni (medium)
Unknown
63.3%
1.6%
Reka Flash
Unknown
18.5%
44.8%
1.6%
Mistral Large 2 (Jul)
Mistral
40.8%
22.2%
1.6%
Mistral Nemo 12B
Mistral
63%
1.6%
imagen-4.0-generate-001
Unknown
62.5%
1.6%
Qwen3 8B
Qwen
15.5%
46.9%
1.6%
intellect-3
Unknown
62%
1.6%
Qwen2.5 Max
Qwen
36%
25.9%
1.5%
reve-v1.1
Unknown
60.5%
1.5%
qwen-vl-max-2025-08-13
Qwen
60.4%
1.5%
Qwen3 30B
Qwen
22.2%
38.1%
1.5%
Qwen3 1.7B
Qwen
3.7%
56.5%
1.5%
nvidia-nemotron-3-nano-30b-a3b-bf16
Unknown
60.1%
1.5%
Qwen3 0.6B
Qwen
0.7%
59.4%
1.5%
minimax-m1
Unknown
59.5%
1.5%
o1-pro
OpenAI
59.3%
1.5%
Vidu Q2
Unknown
59.2%
1.5%
nvidia-llama-3.3-nemotron-super-49b-v1.5
Unknown
57.9%
1.4%
kling-v2.1-master
Unknown
21.1%
36.8%
1.4%
Qwen2.5 Turbo
Qwen
18.5%
38.9%
1.4%
Qwen3 4B
Qwen
18.5%
38.9%
1.4%
wan2.5-t2i-preview
Unknown
57.1%
1.4%
gemma-3-12b-it
Google
57%
1.4%
flux-1-kontext-max
Unknown
37.5%
18.6%
1.4%
kling-image-o1
Unknown
55.8%
1.4%
command-a-03-2025
Unknown
55.4%
1.4%
glm-4-plus-0111
Zhipu
55.4%
1.4%
trinity-large-preview
Unknown
55.4%
1.4%
GPT Image 1 mini
OpenAI
48.2%
7%
1.4%
MiniMax M1 80k
Unknown
54.9%
1.4%
Qwen3 14B
Qwen
22.2%
32.2%
1.4%
qwen-plus-0125
Qwen
54.4%
1.4%
step-2-16k-exp-202412
StepFun
54.1%
1.4%
ERNIE 4.5 300B A47B
Unknown
31.3%
22.6%
1.3%
hunyuan-turbos-20250226
Tencent
53.5%
1.3%
llama-3.1-nemotron-ultra-253b-v1
Meta
52.8%
1.3%
amazon-nova-pro-v1.0
Amazon
38.6%
13.9%
1.3%
Qwen3 30B A3B
Qwen
52.2%
1.3%
Nova 2.0 Omni (low)
Unknown
52.2%
1.3%
Jamba 1.5 Large
Unknown
36.4%
15.5%
1.3%
hunyuan-turbo-0110
Tencent
51.6%
1.3%
llama-3.3-nemotron-49b-super-v1
Meta
51.3%
1.3%
qwen-image-edit
Qwen
51.2%
1.3%
gemma-3n-e4b-it
Google
50.9%
1.3%
Kling 1.6 Pro
Unknown
23.9%
27%
1.3%
yi-lightning
Unknown
50.3%
1.3%
recraft-v4
Unknown
50%
1.3%
qwen2.5-plus-1127
Qwen
49.7%
1.2%
olmo-3-32b-think
Unknown
49.4%
1.2%
amazon-nova-lite-v1.0
Amazon
34.5%
14.9%
1.2%
GPT-3.5 Turbo
OpenAI
14.2%
6.5%
0%
3.2%
6.5%
3.2%
6.5%
9.1%
1.2%
reve-v1
Unknown
48.8%
1.2%
deepseek-v2.5-1210
DeepSeek
48.7%
1.2%
flux-1-kontext-pro
Unknown
32.1%
16.3%
1.2%
athene-v2-chat
Unknown
48.1%
1.2%
gemma-3-4b-it
Google
48.1%
1.2%
glm-4-plus
Zhipu
47.8%
1.2%
hunyuan-video-1.5
Tencent
26.3%
21.1%
1.2%
hunyuan-large-2025-02-10
Tencent
47.2%
1.2%
qwen-image-edit-2511
Qwen
46.5%
1.2%
HunyuanVideo-1.5 (Fal) Open Weights
Tencent
22.5%
23.8%
1.2%
Marey
Unknown
35.2%
11.1%
1.2%
llama-3.1-405b-instruct-bf16
Meta
45.9%
1.1%
llama-3.1-nemotron-70b-instruct
Meta
45.6%
1.1%
flux-2-klein-4b
Unknown
21.4%
23.3%
1.1%
llama-3.1-405b-instruct-fp8
Meta
44.3%
1.1%
mercury
Unknown
44.3%
1.1%
qwen-max-0919
Qwen
44.3%
1.1%
wan2.6-image
Unknown
44.2%
1.1%
Gemma 3 12B
Google
9.1%
34.7%
1.1%
Sora
Unknown
7.9%
33.8%
1.6%
1.1%
Kling 2.1 Pro
Unknown
42.9%
1.1%
mai-image-1
Unknown
42.9%
1.1%
llama-3.3-70b-instruct
Meta
42.7%
1.1%
hunyuan-standard-2025-02-10
Tencent
42.4%
1.1%
deepseek-v2.5
DeepSeek
41.5%
1%
z-image-turbo
Unknown
41.1%
1%
athene-70b-0725
Unknown
40.2%
1%
mistral-large-2411
Mistral
40.2%
1%
kling-o3-pro
Unknown
40%
1%
Kling 2.1 Standard
Unknown
39.7%
1%
seedream-3
Unknown
39.3%
1%
llama-3.1-70b-instruct
Meta
38.9%
1%
llama-3.1-tulu-3-70b
Meta
38.3%
1%
magistral-medium-2506
Mistral
37.7%
0.9%
reka-core-20240904
Unknown
37.3%
0.9%
ibm-granite-h-small
Unknown
37%
0.9%
step-1o-vision-32k-highres
StepFun
36.6%
0.9%
Midjourney V1
Unknown
36.5%
0.9%
mistral-small-24b-instruct-2501
Mistral
36.1%
0.9%
Claude 3.5 Sonnet (Oct)
Anthropic
36%
0.9%
Qwen3.5 2B
Qwen
36%
0.9%
gemma-2-27b-it
Google
35.8%
0.9%
qwen2.5-vl-72b-instruct
Qwen
35.6%
0.9%
qwen2.5-coder-32b-instruct
Qwen
35.4%
0.9%
command-r-plus-08-2024
Cohere
35.1%
0.9%
qwen2.5-vl-32b-instruct
Qwen
34.7%
0.9%
llama-3.1-nemotron-51b-instruct
Meta
34.5%
0.9%
gemma-2-9b-it-simpo
Google
34.2%
0.9%
Vidu Q1
Unknown
19.7%
14.3%
0.9%
qwen-image-prompt-extend
Qwen
33.9%
0.8%
Motion 2.0
Unknown
21.1%
12.7%
0.8%
Jamba 1.5 Mini
Unknown
27.8%
5.7%
0.8%
glm-4-0520
Zhipu
33.5%
0.8%
nemotron-4-340b-instruct
Unknown
33.2%
0.8%
c4ai-aya-expanse-32b
Unknown
32.9%
0.8%
llama-3-70b-instruct
Meta
32.6%
0.8%
Kling 1.5 Pro
Unknown
32.4%
0.8%
Vivago 2.0
Unknown
31.7%
0.8%
olmo-2-0325-32b-instruct
Unknown
31.6%
0.8%
reka-flash-20240904
Unknown
31.6%
0.8%
kling-v2.1-standard
Unknown
31.6%
0.8%
wan-v2.2-a14b
Unknown
18.4%
13.2%
0.8%
amazon-nova-micro-v1.0
Amazon
31%
0.8%
T2V-01-Director
Unknown
31%
0.8%
gemma-2-9b-it
Google
30.7%
0.8%
Granite 4.0 H Small
Unknown
15.5%
15.1%
0.8%
command-r-plus
Cohere
30.4%
0.8%
imagen-3.0-generate-002
Unknown
30.4%
0.8%
Wan 2.1 14B Open Weights
Unknown
23.9%
6.3%
0.8%
p-image-edit
Unknown
30.2%
0.8%
qwen2-72b-instruct
Qwen
30.1%
0.8%
hunyuan-standard-256k
Tencent
29.7%
0.7%
Pika 2.0
Unknown
29.6%
0.7%
llama-3.1-tulu-3-8b
Meta
29.1%
0.7%
kandinsky-5.0-t2v-pro
Unknown
28.9%
0.7%
qwen-image
Qwen
28.6%
0.7%
deepseek-coder-v2
DeepSeek
28.5%
0.7%
ministral-8b-2410
Mistral
28.5%
0.7%
T2V-01
Unknown
28.2%
0.7%
reve-v1.1-fast
Unknown
27.9%
0.7%
command-r-08-2024
Cohere
27.8%
0.7%
Claude 3.5 Sonnet (June)
Anthropic
27.6%
0.7%
llama-3.1-8b-instruct
Meta
27.2%
0.7%
c4ai-aya-expanse-8b
Unknown
26.9%
0.7%
ideogram-v3-quality
Unknown
26.8%
0.7%
mistral-large-2402
Mistral
26.6%
0.7%
seedance-v1-lite
Unknown
10.5%
15.8%
0.7%
qwen1.5-110b-chat
Qwen
26.3%
0.7%
yi-1.5-34b-chat
Unknown
25.9%
0.6%
ppl-sonar-reasoning-pro-high
Unknown
25.9%
0.6%
qwen-vl-max-1119
Qwen
25.7%
0.6%
qwen2-vl-72b
Qwen
25.7%
0.6%
reka-flash-21b-20240226-online
Unknown
25.6%
0.6%
reve-edit-fast
Unknown
25.6%
0.6%
qwen1.5-72b-chat
Qwen
25.3%
0.6%
p-image
Unknown
25%
0.6%
llama-3-8b-instruct
Meta
24.4%
0.6%
reka-flash-21b-20240226
Unknown
24.4%
0.6%
command-r
Cohere
24.1%
0.6%
Kling 1.6 Standard
Unknown
23.9%
0.6%
step-1v-32k
StepFun
23.8%
0.6%
ltx-2-19b
Unknown
15.8%
7.9%
0.6%
mixtral-8x22b-instruct-v0.1
Unknown
23.4%
0.6%
qwq-32b-preview
Unknown
23.4%
0.6%
photon
Unknown
23.2%
0.6%
internlm2_5-20b-chat
Unknown
23.1%
0.6%
gemma-2-2b-it
Google
22.8%
0.6%
granite-3.1-8b-instruct
Unknown
22.5%
0.6%
Nova Pro
Unknown
22.2%
0.6%
zephyr-orpo-141b-A35b-v0.1
Unknown
22.2%
0.6%
phi-3-medium-4k-instruct
Unknown
21.5%
0.5%
qwen1.5-32b-chat
Qwen
21.5%
0.5%
starling-lm-7b-beta
Unknown
21.2%
0.5%
wan2.5-i2i-preview
Unknown
20.9%
0.5%
mixtral-8x7b-instruct-v0.1
Unknown
20.9%
0.5%
molmo-72b-0924
Unknown
20.8%
0.5%
Runway Gen 3 Alpha
Unknown
15.5%
4.8%
0.5%
qwen1.5-14b-chat
Qwen
19.9%
0.5%
yi-34b-chat
Unknown
19.9%
0.5%
hunyuan-standard-vision-2024-12-31
Tencent
19.8%
0.5%
runway-gen4
Unknown
19.6%
0.5%
granite-3.1-2b-instruct
Unknown
19.6%
0.5%
tulu-2-dpo-70b
Unknown
19.3%
0.5%
Runway Gen 4
Unknown
19%
0.5%
llama-3.2-vision-90b-instruct
Meta
18.8%
0.5%
dbrx-instruct-preview
Unknown
18.7%
0.5%
wizardlm-70b
Unknown
18.7%
0.5%
Solar Mini
Unknown
18.5%
0.5%
llama-2-70b-chat
Meta
18.4%
0.5%
Mochi 1 Open Weights
Unknown
18.3%
0.5%
nous-hermes-2-mixtral-8x7b-dpo
Unknown
18%
0.5%
recraft-v3
Unknown
17.9%
0.4%
qwen2-vl-7b-instruct
Qwen
17.8%
0.4%
phi-3-small-8k-instruct
Unknown
17.7%
0.4%
flux-1-kontext-dev
Unknown
3.6%
14%
0.4%
I2V-01-Director
Unknown
17.5%
0.4%
llama-3.2-3b-instruct
Meta
17.4%
0.4%
starling-lm-7b-alpha
Unknown
17.1%
0.4%
Hunyuan Video (Fal) Open Weights
Tencent
16.9%
0%
0.4%
pixtral-12b-2409
Unknown
16.8%
0.4%
openchat-3.5-0106
Unknown
16.8%
0.4%
Pika 2.2
Unknown
7%
9.5%
0.4%
deepseek-llm-67b-chat
DeepSeek
16.1%
0.4%
vicuna-33b
Unknown
16.1%
0.4%
flux-1.1-pro
Unknown
16.1%
0.4%
LTX Video v0.9.7 13B Open Weights
Unknown
15.9%
0.4%
internvl2-26b
Unknown
15.8%
0.4%
snowflake-arctic-instruct
Unknown
15.8%
0.4%
llama2-70b-steerlm-chat
Unknown
15.5%
0.4%
Qwen3.5 0.8B
Qwen
15.5%
0.4%
openchat-3.5
Unknown
15.2%
0.4%
granite-3.0-8b-instruct
Unknown
14.9%
0.4%
gemma-1.1-7b-it
Google
14.2%
0.4%
Ray 1
Unknown
14.1%
0.4%
openhermes-2.5-mistral-7b
Unknown
13.9%
0.3%
mistral-7b-instruct-v0.2
Mistral
13.6%
0.3%
llama-2-13b-chat
Meta
13.3%
0.3%
kandinsky-5.0-t2v-lite
Unknown
13.2%
0.3%
Wan 2.2 5B Open Weights
Unknown
9.9%
3.2%
0.3%
Krea Realtime Open Weights
Unknown
12.7%
0.3%
qwen1.5-7b-chat
Qwen
12.7%
0.3%
solar-10.7b-instruct-v1.0
Unknown
12.7%
0.3%
ideogram-v2
Unknown
12.5%
0.3%
lucid-origin
Unknown
12.5%
0.3%
dolphin-2.2.1-mistral-7b
Unknown
12.3%
0.3%
yi-vision
Unknown
11.9%
0.3%
granite-3.0-2b-instruct
Unknown
11.7%
0.3%
phi-3-mini-4k-instruct-june-2024
Unknown
11.7%
0.3%
seededit-3.0
Unknown
11.6%
0.3%
wizardlm-13b
Unknown
11.4%
0.3%
Llama 2 Chat 7B
Meta
11.1%
0.3%
Mistral Large (Feb)
Mistral
11.1%
0.3%
ppl-sonar-pro-high
Unknown
11.1%
0.3%
Reka Flash 3
Unknown
11.1%
0.3%
phi-3-mini-4k-instruct
Unknown
11.1%
0.3%
zephyr-7b-beta
Unknown
10.8%
0.3%
glm-image
Zhipu
10.7%
0.3%
ray2
Unknown
5.3%
5.3%
0.3%
mpt-30b-chat
Unknown
10.4%
0.3%
codellama-34b-instruct
Unknown
10.1%
0.3%
c4ai-aya-vision-32b
Unknown
9.9%
0.2%
Kling 1.0
Unknown
9.9%
0.2%
Granite 3.3 8B
Unknown
3.7%
5.9%
0.2%
vicuna-13b
Unknown
9.5%
0.2%
zephyr-7b-alpha
Unknown
9.5%
0.2%
codellama-70b-instruct
Unknown
9.2%
0.2%
Llama 3 70B
Meta
9.1%
0.2%
molmo-7b-d-0924
Unknown
8.9%
0.2%
gemma-7b-it
Google
8.9%
0.2%
llama-3.2-1b-instruct
Meta
8.5%
0.2%
falcon-180b-chat
Unknown
8.2%
0.2%
Runway Gen 3 Alpha Turbo
Unknown
7.9%
0.2%
llama-3.2-vision-11b-instruct
Meta
7.9%
0.2%
guanaco-33b
Unknown
7.6%
0.2%
llama-2-7b-chat
Meta
7.6%
0.2%
qwen-14b-chat
Qwen
7.3%
0.2%
flux-1-dev-fp8
Unknown
7.1%
0.2%
Ray 2
Unknown
7%
0.2%
phi-3-mini-128k-instruct
Unknown
7%
0.2%
nvila-internal-15b-v1
Unknown
6.9%
0.2%
smollm2-1.7b-instruct
Unknown
6.6%
0.2%
stripedhyena-nous-7b
Unknown
6.3%
0.2%
olmo-7b-instruct
Unknown
6%
0.2%
llava-onevision-qwen2-72b-ov
Unknown
5.9%
0.1%
Apertus 70B Instruct
Unknown
5.7%
0.1%
Command-R+ (Apr)
Cohere
5.7%
0.1%
LFM2 2.6B
Unknown
5.7%
0.1%
LFM2.5-1.2B-Instruct
Unknown
5.7%
0.1%
Mixtral 8x7B
Unknown
5.7%
0.1%
Olmo 3 7B
Unknown
5.7%
0.1%
Sarvam M
Unknown
5.7%
0.1%
vicuna-7b
Unknown
5.7%
0.1%
Haiper 2.0
Unknown
5.6%
0.1%
palm-2
Unknown
5.4%
0.1%
dall-e-3
Unknown
5.4%
0.1%
llava-v1.6-34b
Unknown
5%
0.1%
gemma-1.1-2b-it
Google
4.7%
0.1%
gemma-2b-it
Google
4.4%
0.1%
Pika 1.5
Unknown
4.2%
0.1%
qwen1.5-4b-chat
Qwen
4.1%
0.1%
koala-13b
Unknown
3.8%
0.1%
Command-R (Mar)
Cohere
3.7%
0.1%
diffbot-small-xl
Unknown
3.7%
0.1%
LFM2 8B A1B
Unknown
3.7%
0.1%
Molmo2-8B
Unknown
3.7%
0.1%
chatglm3-6b
Unknown
3.5%
0.1%
gpt4all-13b-snoozy
Unknown
3.2%
0.1%
cogvlm2-llama3-chat-19b
Unknown
3%
0.1%
minicpm-v-2_6
Unknown
3%
0.1%
mpt-7b-chat
Unknown
2.8%
0.1%
Step-Video-T2V
StepFun
2.8%
0.1%
pika-v2.2
Unknown
2.6%
0%
0.1%
runway-gen4-turbo
Unknown
2.6%
0.1%
RWKV-4-Raven-14B
Unknown
2.5%
0.1%
bagel
Unknown
0%
2.3%
0.1%
chatglm2-6b
Unknown
2.2%
0.1%
internvl2-4b
Unknown
2%
0%
alpaca-13b
Unknown
1.9%
0%
stable-diffusion-v35-large
Unknown
1.8%
0%
chatglm-6b
Unknown
1.6%
0%
CogVideoX-5B Open Weights
Unknown
1.4%
0%
oasst-pythia-12b
Unknown
1.3%
0%
phi-3.5-vision-instruct
Unknown
1%
0%
fastchat-t5-3b
Unknown
0.9%
0%
Apertus 8B Instruct
Unknown
0.7%
0%
LFM2 1.2B
Unknown
0.7%
0%
LFM2.5-VL-1.6B
Unknown
0.7%
0%
Llama 3 8B
Meta
0.7%
0%
stablelm-tuned-alpha-7b
Unknown
0.6%
0%
dolly-v2-12b
Unknown
0.3%
0%
acm_rewrite_qwen2-72B-Chat
Unknown
0%
azerogpt
Unknown
0%
codegen3_5k-qwen2.5-72b-instruct-2-chk-50
Unknown
0%
Codestral
Mistral
0%
Codestral Embed
Mistral
0%
coding-meta-llama-3.1-70b-instruct-chk-50
Unknown
0%
coding2-amcfull-apifull-mmlu12k-meta-llama-3.1-70b-instruct-chk-150
Unknown
0%
deepseek-coder
DeepSeek
0%
DeepSeek-Coder-V2-Lite-Instruct
DeepSeek
0%
deepseek-r1-distill-qwen-32b
DeepSeek
0%
deepseek-r1-local
DeepSeek
0%
deepseek-r1-local-2
DeepSeek
0%
DeepSeek-V2-Lite-Chat
DeepSeek
0%
devstral-medium-2507
Mistral
0%
0%
0%
dracarys2-72b-instruct
Unknown
0%
dracarys2-llama-3.1-70b-instruct
Unknown
0%
FLUX.2 [max]
Unknown
0%
0%
Gemma 3n E2B
Google
0%
0%
GPT Realtime 1.5
OpenAI
0%
GPT Realtime mini
OpenAI
0%
GPT-4o mini Transcribe
OpenAI
0%
GPT-4o mini TTS
OpenAI
0%
GPT-4o Transcribe
OpenAI
0%
hunyuan-turbos-20250313
Tencent
0%
HunyuanImage 3.0 Instruct (Fal)
Tencent
0%
0%
lcb-math-qwen2-72b-instructv3-merged-50
Unknown
0%
Leanstral
Mistral
0%
learnlm-1.5-pro-experimental
Google
0%
Llama 4 Behemoth
Meta
0%
Llama-2-7b-chat-hf
Meta
0%
Llama-3.1-Nemotron-70B-Instruct-HF
Meta
0%
llama-3.3-70b-instruct-turbo
Meta
0%
llama-13b
Meta
0%
0%
mathstral-7B-v0.1
Unknown
0%
Meta-Llama-3-8B-Instruct
Meta
0%
Meta-Llama-3-70B-Instruct
Meta
0%
meta-llama-3.1-8b-instruct-turbo
Meta
0%
meta-llama-3.1-70b-instruct-turbo
Meta
0%
meta-llama-3.1-405b-instruct-turbo
Meta
0%
Mistral Embed
Mistral
0%
Mistral Moderation 2
Mistral
0%
Mistral-7B-Instruct-v0.3
Mistral
0%
mistral-small-2409
Mistral
0%
mistral-small-2501
Mistral
0%
mistral-small-2503
Mistral
0%
mochi-v1
Unknown
0%
0%
OCR 3
Mistral
0%
olmo-2-1124-13b-instruct
Unknown
0%
open-mistral-nemo
Unknown
0%
open-mixtral-8x7b
Unknown
0%
open-mixtral-8x22b
Unknown
0%
perplexity-sonar-reasoning
Unknown
0%
Phi-3-medium-128k-instruct
Unknown
0%
Phi-3-small-128k-instruct
Unknown
0%
phi-3-vision-128k-instruct
Unknown
0%
0%
Phi-3.5-mini-instruct
Unknown
0%
Phi-3.5-MoE-instruct
Unknown
0%
Pyramid Flow Open Weights
Unknown
0%
0%
Qwen1.5-0.5B-Chat
Qwen
0%
Qwen1.5-1.8B-Chat
Qwen
0%
Qwen2-0.5B-Instruct
Qwen
0%
Qwen2-1.5B-Instruct
Qwen
0%
Qwen2-7B-Instruct
Qwen
0%
qwen2-math-72b-instruct
Qwen
0%
Qwen2.5-7B-Instruct-Turbo
Qwen
0%
Qwen2.5-72B-Instruct-Turbo
Qwen
0%
Qwen3-Coder Plus
Qwen
0%
Qwen3.5 Plus
Qwen
0%
Reflection-Llama-3.1-70B
Unknown
0%
runway-gen4-aleph
Unknown
0%
0%
sky-t1-32b-preview
Unknown
0%
Smaug-Qwen2-72B-Instruct
Unknown
0%
sonar
Unknown
0%
sonar-pro
Unknown
0%
step-2-16k-202411
StepFun
0%
step1x-edit
Unknown
0%
0%
Tiny Aya Global
Unknown
0%
0%
vicuna-7b-v1.5
Unknown
0%
vicuna-7b-v1.5-16k
Unknown
0%
Voxtral Mini Transcribe 2
Mistral
0%
Voxtral Mini Transcribe Realtime
Mistral
0%
Voxtral Small
Mistral
0%
Voxtral TTS
Mistral
0%
wbot-4:347b_no_s
Unknown
0%
Yi-6B-Chat
Unknown
0%
weakerstronger percentile inside group