UABUnbiased AI BenchGlass box for model evals.
Every leaderboard, with receipts.
Home/Benchmarks/Image Edit Arena
Image Edit Arena
Live · updated continuously
Benchmarks · /benchmarks/arena-image-edit

Image Edit Arena

Blind preference arena for instruction-based image editing quality.
Source · Arena
Version · arena snapshot 2026-05-01
Scores · 44

Passport

Visible tradeoffsThis is a human preference signal, so it tells you what people liked side by side, not what is formally correct.
source
Arena
metric
Arena rating (rating)
judge
Human
direction
higher better
group id
arena_image_edit_2026_q2
domain
Image editing

What it measures vs what it misses

✓ Measures

Observed user preference when models edit the same source image under the same instruction. How well edits preserve intent while landing visually preferred outputs.

✗ Misses

Pixel-level preservation fidelity. Objective adherence scoring for localized edits or safety policies.

Why this countsIt would matter for editing and instruction-following once verified public receipts exist in the catalog.Comparable-group ruleThis percentile only compares models inside the exact benchmark/version group shown here. It is not a universal score.What it missesThis slice is currently limited because the product does not yet carry first-class image-editing receipts.

Leaderboard · this benchmark version

#1 · GPT Image 2 (high)
AR · May 1, 2026
1,510
#2 · chatgpt-image-latest-high-fidelity (20251216)
AR · May 1, 2026
1,393
#3 · gemini-3-pro-image-preview-2k (nano-banana-pro)
AR · May 1, 2026
1,389
#4 · Gemini 3 Pro Image Preview
AR · May 1, 2026
1,387
#5 · gemini-3.1-flash-image-preview (nano-banana-2) [web-search]
AR · May 1, 2026
1,387
#6 · gpt-image-1.5-high-fidelity
AR · May 1, 2026
1,376
#7 · Grok Imagine Image Pro
AR · May 1, 2026
1,316
#8 · Grok Imagine Image
AR · May 1, 2026
1,312
#9 · seedream-4.5
AR · May 1, 2026
1,304
#10 · wan2.7-image-pro
AR · May 1, 2026
1,304
#11 · wan2.7-image
AR · May 1, 2026
1,303
#12 · gemini-2.5-flash-image-preview (nano-banana)
AR · May 1, 2026
1,300
#13 · hunyuan-image-3.0-instruct
AR · May 1, 2026
1,299
#14 · seedream-5.0-lite
AR · May 1, 2026
1,292
#15 · seedream-4-2k
AR · May 1, 2026
1,274
#16 · qwen-image-2.0-pro-2026-04-22
AR · May 1, 2026
1,272
#17 · flux-2-max
AR · May 1, 2026
1,263
#18 · reve-v1.1
AR · May 1, 2026
1,261
#19 · qwen-image-2.0-2026-03-03
AR · May 1, 2026
1,257
#20 · kling-image-o1
AR · May 1, 2026
1,256
#21 · flux-2-pro
AR · May 1, 2026
1,241
#22 · qwen-image-edit
AR · May 1, 2026
1,239
#23 · reve-v1
AR · May 1, 2026
1,237
#24 · qwen-image-edit-2511
AR · May 1, 2026
1,234
#25 · wan2.6-image
AR · May 1, 2026
1,230
#26 · flux-2-flex
AR · May 1, 2026
1,226
#27 · flux-2-dev
AR · May 1, 2026
1,226
#28 · flux-2-klein-9b
AR · May 1, 2026
1,225
#29 · seedream-4-high-res-fal
AR · May 1, 2026
1,224
#30 · seedream-4-fal
AR · May 1, 2026
1,213
#31 · p-image-edit
AR · May 1, 2026
1,210
#32 · reve-v1.1-fast
AR · May 1, 2026
1,209
#33 · reve-edit-fast
AR · May 1, 2026
1,200
#34 · flux-2-klein-4b
AR · May 1, 2026
1,190
#35 · wan2.5-i2i-preview
AR · May 1, 2026
1,184
#36 · flux-1-kontext-max
AR · May 1, 2026
1,183
#37 · flux-1-kontext-pro
AR · May 1, 2026
1,178
#38 · flux-1-kontext-dev
AR · May 1, 2026
1,152
#39 · seededit-3.0
AR · May 1, 2026
1,141
#40 · GPT Image 1.5
AR · May 1, 2026
1,140
#41 · GPT Image 1 mini
AR · May 1, 2026
1,125
#42 · gemini-2.0-flash-preview-image-generation
AR · May 1, 2026
1,083
#43 · bagel
AR · May 1, 2026
1,028
#44 · step1x-edit
AR · May 1, 2026
1,000