UABUnbiased AI BenchGlass box for model evals.
Every leaderboard, with receipts.
Home/Benchmarks/Time to first token
Time to first token
Live · updated continuously
Benchmarks · /benchmarks/artificial-analysis-time-to-first-token

Time to first token

Latency measure for time to first token.
Source · Artificial Analysis
Version · artificial-analysis snapshot 2026-05-01
Scores · 240

Passport

Visible tradeoffsThis is an efficiency signal, so it belongs beside quality rather than being mistaken for quality.
source
Artificial Analysis
metric
TTFT (s)
judge
Speed / cost
direction
lower better
group id
aa_ttft_current
domain
Chat / text

What it measures vs what it misses

✓ Measures

Initial response latency.

✗ Misses

Output quality.

Why this countsIt tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.Comparable-group ruleThis percentile only compares models inside the exact benchmark/version group shown here. It is not a universal score.What it missesIt does not prove deeper reasoning, tool use, or enterprise workflow reliability.

Leaderboard · this benchmark version

#1 · NVIDIA Nemotron 3 Nano
AA · May 1, 2026
0.40s
#2 · Ministral 3 3B
AA · May 1, 2026
0.47s
#3 · Qwen3.5 0.8B
AA · May 1, 2026
0.52s
#4 · LFM2 24B A2B
AA · May 1, 2026
0.55s
#5 · Mistral 7B
AA · May 1, 2026
0.57s
#6 · Grok 3 mini Reasoning (high)
AA · May 1, 2026
0.58s
#7 · Ministral 3 8B
AA · May 1, 2026
0.58s
#8 · Devstral Small
AA · May 1, 2026
0.65s
#9 · NVIDIA Nemotron Nano 9B V2
AA · May 1, 2026
0.65s
#10 · Mistral Small 3.2
AA · May 1, 2026
0.66s
#11 · Qwen3.5 9B
AA · May 1, 2026
0.66s
#12 · GPT-4.1 nano
AA · May 1, 2026
0.69s
#13 · Ministral 3 14B
AA · May 1, 2026
0.71s
#14 · GPT-OSS 20B
AA · May 1, 2026
0.72s
#15 · Grok 4.20 0309
AA · May 1, 2026
0.73s
#16 · Granite 4.1 8B
AA · May 1, 2026
0.76s
#17 · Claude Haiku 4.5
AA · May 1, 2026
0.77s
#18 · Magistral Small 1.2
AA · Apr 19, 2026
0.81s
#19 · Gemma 3 1B
AA · May 1, 2026
0.82s
#20 · Phi-4 Mini
AA · May 1, 2026
0.82s
#21 · Llama 3.2 11B (Vision)
AA · May 1, 2026
0.83s
#22 · Cogito v2.1
AA · May 1, 2026
0.84s
#23 · Llama 4 Scout
AA · May 1, 2026
0.85s
#24 · Phi-4 Multimodal
AA · May 1, 2026
0.85s
#25 · Mistral Small 3
AA · May 1, 2026
0.87s
#26 · Trinity Large Thinking
AA · May 1, 2026
0.87s
#27 · Llama 3.1 8B
AA · May 1, 2026
0.88s
#28 · Qwen3.5 4B
AA · May 1, 2026
0.88s
#29 · Mistral Small 4
AA · May 1, 2026
0.89s
#30 · GPT-OSS 120B
AA · May 1, 2026
0.90s
#31 · Jamba 1.6 Mini
AA · May 1, 2026
0.91s
#32 · Qwen3.5 2B
AA · May 1, 2026
0.91s
#33 · Llama 3.2 1B
AA · May 1, 2026
0.94s
#34 · Nova Micro
AA · May 1, 2026
0.94s
#35 · GPT-4.1 mini
AA · May 1, 2026
0.95s
#36 · Mistral Small (Sep)
AA · May 1, 2026
0.96s
#37 · Mistral Small 3.1
AA · May 1, 2026
0.97s
#38 · Qwen3.5 4B
AA · May 1, 2026
0.97s
#39 · Nova Lite
AA · May 1, 2026
1.00s
#40 · Mistral Small (Feb)
AA · May 1, 2026
1.01s
#41 · Devstral 2
AA · May 1, 2026
1.03s
#42 · Qwen3 Coder Next
AA · May 1, 2026
1.05s
#43 · Llama 4 Maverick
AA · May 1, 2026
1.06s
#44 · Mistral Large 3
AA · May 1, 2026
1.09s
#45 · NVIDIA Nemotron Nano 9B V2
AA · May 1, 2026
1.09s
#46 · Llama 3.2 3B
AA · May 1, 2026
1.10s
#47 · NVIDIA Nemotron Nano 12B v2 VL
AA · May 1, 2026
1.14s
#48 · NVIDIA Nemotron Nano 12B v2 VL
AA · May 1, 2026
1.14s
#49 · DeepSeek V4 Flash (Max)
AA · May 1, 2026
1.16s
#50 · DeepSeek V4 Flash
AA · May 1, 2026
1.18s
#51 · Nova 2.0 Lite
AA · May 1, 2026
1.18s
#52 · Nova 2.0 Pro Preview
AA · May 1, 2026
1.18s
#53 · Llama 3.2 90B (Vision)
AA · May 1, 2026
1.21s
#54 · NVIDIA Nemotron 3 Nano
AA · May 1, 2026
1.21s
#55 · Claude Sonnet 4
AA · May 1, 2026
1.22s
#56 · Step 3.5 Flash 2603
AA · May 1, 2026
1.23s
#57 · GPT-4.1
AA · May 1, 2026
1.24s
#58 · Gemma 3n E4B
AA · May 1, 2026
1.26s
#59 · Llama Nemotron Super 49B v1.5
AA · May 1, 2026
1.26s
#60 · GLM-4.7-Flash
AA · May 1, 2026
1.28s
#61 · GLM-4.7
AA · May 1, 2026
1.30s
#62 · Llama Nemotron Super 49B v1.5
AA · May 1, 2026
1.30s
#63 · Claude Sonnet 4.5
AA · May 1, 2026
1.31s
#64 · Mistral Medium 3
AA · May 1, 2026
1.32s
#65 · Devstral Small 2
AA · May 1, 2026
1.34s
#66 · Devstral Medium
AA · May 1, 2026
1.36s
#67 · GPT-4o
AA · May 1, 2026
1.40s
#68 · GLM-4.6
AA · May 1, 2026
1.40s
#69 · GLM-4.7
AA · May 1, 2026
1.40s
#70 · Hermes 4 70B
AA · May 1, 2026
1.41s
#71 · Hermes 4 70B
AA · May 1, 2026
1.41s
#72 · Nova 2.0 Omni
AA · May 1, 2026
1.42s
#73 · NVIDIA Nemotron 3 Super
AA · May 1, 2026
1.42s
#74 · Step 3.5 Flash
AA · May 1, 2026
1.42s
#75 · GLM-5.1
AA · May 1, 2026
1.43s
#76 · GLM-4.6
AA · May 1, 2026
1.44s
#77 · Jamba 1.6 Large
AA · May 1, 2026
1.44s
#78 · Gemma 3 4B
AA · May 1, 2026
1.47s
#79 · KAT-Coder-Pro V1
AA · May 1, 2026
1.49s
#80 · Olmo 3.1 32B Instruct
AA · May 1, 2026
1.50s
#81 · Llama 3.3 70B
AA · May 1, 2026
1.51s
#82 · GLM-4.7-Flash
AA · May 1, 2026
1.54s
#83 · Jamba 1.7 Large
AA · May 1, 2026
1.55s
#84 · Ling 2.6 Flash
AA · May 1, 2026
1.56s
#85 · Claude Opus 4.1
AA · May 1, 2026
1.57s
#86 · Claude Opus 4
AA · May 1, 2026
1.60s
#87 · Mistral Medium 3.5
AA · May 1, 2026
1.62s
#88 · Magistral Medium 1.2
AA · Apr 19, 2026
1.65s
#89 · Kimi K2
AA · May 1, 2026
1.66s
#90 · MiniMax-M2
AA · May 1, 2026
1.67s
#91 · Mistral Medium
AA · May 1, 2026
1.67s
#92 · Pixtral Large
AA · May 1, 2026
1.69s
#93 · Gemma 4 31B
AA · May 1, 2026
1.71s
#94 · Mistral Large 2 (Nov)
AA · May 1, 2026
1.72s
#95 · Grok 3
AA · May 1, 2026
1.74s
#96 · Kimi K2 Thinking
AA · May 1, 2026
1.75s
#97 · DeepSeek V4 Pro (High)
AA · May 1, 2026
1.76s
#98 · Qwen3 0.6B
AA · May 1, 2026
1.78s
#99 · MiniMax-M2.1
AA · May 1, 2026
1.82s
#100 · MiniMax-M2.5
AA · May 1, 2026
1.82s
#101 · Mistral Medium 3.1
AA · May 1, 2026
1.82s
#102 · GLM-5
AA · May 1, 2026
1.83s
#103 · Llama 3.1 70B
AA · May 1, 2026
1.85s
#104 · DeepSeek R1 Distill Llama 70B
AA · May 1, 2026
1.88s
#105 · Qwen3 1.7B
AA · May 1, 2026
1.89s
#106 · Qwen3 Omni 30B A3B
AA · May 1, 2026
1.91s
#107 · DeepSeek V4 Pro (Max)
AA · May 1, 2026
1.95s
#108 · GLM-5.1
AA · May 1, 2026
1.95s
#109 · Gemma 3 27B
AA · May 1, 2026
1.97s
#110 · MiniMax-M2.7
AA · May 1, 2026
1.98s
#111 · Command A
AA · May 1, 2026
1.99s
#112 · DeepSeek V4 Pro
AA · May 1, 2026
2.00s
#113 · Qwen3.5 Omni Flash
AA · May 1, 2026
2.00s
#114 · Qwen3 Omni 30B A3B
AA · May 1, 2026
2.02s
#115 · QwQ-32B
AA · May 1, 2026
2.03s
#116 · MiMo-V2-Omni
AA · May 1, 2026
2.04s
#117 · Qwen3 30B A3B 2507
AA · May 1, 2026
2.05s
#118 · Sarvam 30B (high)
AA · May 1, 2026
2.06s
#119 · MiMo-V2-Omni-0327
AA · May 1, 2026
2.08s
#120 · Hermes 3 - Llama-3.1 70B
AA · May 1, 2026
2.10s
#121 · Phi-4
AA · May 1, 2026
2.12s
#122 · Llama 3.1 Nemotron 70B
AA · May 1, 2026
2.19s
#123 · Qwen3 Next 80B A3B
AA · May 1, 2026
2.22s
#124 · Ring-flash-2.0
AA · May 1, 2026
2.23s
#125 · Qwen3 VL 30B A3B
AA · May 1, 2026
2.24s
#126 · Qwen3 VL 30B A3B
AA · May 1, 2026
2.25s
#127 · Qwen3 8B
AA · May 1, 2026
2.27s
#128 · GLM-5
AA · May 1, 2026
2.27s
#129 · GLM-4.5V
AA · May 1, 2026
2.28s
#130 · Qwen3.5 35B A3B
AA · May 1, 2026
2.30s
#131 · MiMo-V2-Flash
AA · May 1, 2026
2.31s
#132 · KAT-Coder-Pro V2
AA · May 1, 2026
2.32s
#133 · Reka Flash
AA · May 1, 2026
2.33s
#134 · Sarvam 105B (high)
AA · May 1, 2026
2.34s
#135 · Ling-flash-2.0
AA · May 1, 2026
2.36s
#136 · Qwen3.6 35B A3B
AA · May 1, 2026
2.36s
#137 · Qwen3 VL 8B
AA · May 1, 2026
2.37s
#138 · Hermes 4 405B
AA · May 1, 2026
2.43s
#139 · Llama 3.1 405B
AA · May 1, 2026
2.43s
#140 · Qwen3 30B
AA · May 1, 2026
2.43s
#141 · Qwen3 Next 80B A3B
AA · May 1, 2026
2.43s
#142 · Qwen3 VL 8B
AA · May 1, 2026
2.43s
#143 · Hermes 4 405B
AA · May 1, 2026
2.44s
#144 · Qwen3.6 35B A3B
AA · May 1, 2026
2.45s
#145 · Qwen3.5 397B A17B
AA · May 1, 2026
2.46s
#146 · Qwen3 4B
AA · May 1, 2026
2.47s
#147 · Qwen2.5 Turbo
AA · May 1, 2026
2.47s
#148 · Qwen3 30B A3B 2507
AA · May 1, 2026
2.48s
#149 · Qwen3 30B
AA · May 1, 2026
2.48s
#150 · Qwen3.5 122B A10B
AA · May 1, 2026
2.49s
#151 · Qwen3.5 Omni Plus
AA · May 1, 2026
2.49s
#152 · Qwen3 32B
AA · May 1, 2026
2.52s
#153 · Qwen3 235B 2507
AA · May 1, 2026
2.55s
#154 · Llama Nemotron Ultra
AA · May 1, 2026
2.58s
#155 · Gemma 3 12B
AA · May 1, 2026
2.63s
#156 · Qwen3 VL 32B
AA · May 1, 2026
2.63s
#157 · Qwen3 VL 32B
AA · May 1, 2026
2.63s
#158 · MiMo-V2-Flash (Feb 2026)
AA · May 1, 2026
2.65s
#159 · Qwen3 VL 235B A22B
AA · May 1, 2026
2.65s
#160 · MiMo-V2-Flash
AA · May 1, 2026
2.67s
#161 · Qwen3 Coder 30B A3B
AA · May 1, 2026
2.68s
#162 · GPT-4 Turbo
AA · May 1, 2026
2.75s
#163 · Qwen3 14B
AA · May 1, 2026
2.80s
#164 · Qwen3 235B A22B 2507
AA · May 1, 2026
2.81s
#165 · GLM-4.5-Air
AA · May 1, 2026
2.82s
#166 · Qwen3 235B
AA · May 1, 2026
2.82s
#167 · Qwen3.6 Plus
AA · May 1, 2026
2.83s
#168 · Kimi K2.6
AA · May 1, 2026
2.87s
#169 · GLM-4.6V
AA · May 1, 2026
2.90s
#170 · Qwen3 235B
AA · May 1, 2026
2.92s
#171 · Kimi K2.5
AA · May 1, 2026
2.94s
#172 · Qwen3-Coder 480B A35B
AA · May 1, 2026
2.95s
#173 · Kimi K2.6
AA · May 1, 2026
2.97s
#174 · Kimi K2 0905
AA · May 1, 2026
2.98s
#175 · Kimi K2.5
AA · May 1, 2026
2.98s
#176 · Qwen3 VL 235B A22B
AA · May 1, 2026
3.12s
#177 · MiMo-V2.5-Pro
AA · May 1, 2026
3.15s
#178 · Qwen2.5 Max
AA · May 1, 2026
3.19s
#179 · GLM-4.5
AA · May 1, 2026
3.22s
#180 · MiMo-V2.5-Pro
AA · May 1, 2026
3.28s
#181 · Nova Premier
AA · May 1, 2026
3.32s
#182 · Seed-OSS-36B-Instruct
AA · May 1, 2026
3.38s
#183 · Hy3-preview
AA · May 1, 2026
3.39s
#184 · GPT-4o mini
AA · May 1, 2026
3.45s
#185 · ERNIE 4.5 300B A47B
AA · May 1, 2026
3.46s
#186 · Qwen3.6 Max Preview
AA · May 1, 2026
3.46s
#187 · Qwen2.5 72B
AA · May 1, 2026
3.61s
#188 · Hy3-preview
AA · May 1, 2026
3.63s
#189 · MiMo-V2-Pro
AA · May 1, 2026
3.66s
#190 · Qwen3.6 27B
AA · May 1, 2026
3.80s
#191 · Qwen3.6 27B
AA · May 1, 2026
3.92s
#192 · Mercury 2
AA · May 1, 2026
4.00s
#193 · Qwen3 Max
AA · May 1, 2026
4.13s
#194 · Qwen3 Max Thinking
AA · May 1, 2026
4.24s
#195 · Gemini 3.1 Flash-Lite Preview
AA · May 1, 2026
5.46s
#196 · Qwen3.5 27B
AA · May 1, 2026
5.77s
#197 · Grok 4.1 Fast
AA · May 1, 2026
7.23s
#198 · Gemini 3 Flash
AA · May 1, 2026
7.51s
#199 · Nova 2.0 Pro Preview (low)
AA · May 1, 2026
7.81s
#200 · Nova 2.0 Lite (low)
AA · May 1, 2026
7.88s
#201 · Grok Code Fast
AA · May 1, 2026
7.91s
#202 · LongCat Flash Lite
AA · May 1, 2026
7.97s
#203 · Grok 4 Fast
AA · May 1, 2026
8.02s
#204 · Granite 4.0 H Small
AA · May 1, 2026
10.21s
#205 · Claude 4.5 Sonnet
AA · May 1, 2026
10.30s
#206 · Claude 4 Opus
AA · May 1, 2026
11.28s
#207 · GLM-4.6V
AA · May 1, 2026
11.30s
#208 · o3
AA · May 1, 2026
11.96s
#209 · Nova 2.0 Lite (medium)
AA · May 1, 2026
12.89s
#210 · Grok 4.3
AA · May 1, 2026
13.44s
#211 · Claude 4 Sonnet
AA · May 1, 2026
13.54s
#212 · Claude 4.1 Opus
AA · May 1, 2026
14.02s
#213 · Claude 4.5 Haiku
AA · May 1, 2026
14.06s
#214 · Nova 2.0 Pro Preview (medium)
AA · May 1, 2026
14.15s
#215 · Nova 2.0 Lite (high)
AA · May 1, 2026
14.62s
#216 · Claude Opus 4.5
AA · May 1, 2026
15.64s
#217 · Grok 4
AA · May 1, 2026
15.69s
#218 · Gemini 2.5 Flash
AA · May 1, 2026
16.45s
#219 · Claude Opus 4.7
AA · May 1, 2026
20.83s
#220 · Gemini 2.5 Flash-Lite
AA · May 1, 2026
20.87s
#221 · Grok 4.20
AA · May 1, 2026
21.15s
#222 · Gemini 2.5 Pro
AA · May 1, 2026
24.44s
#223 · Gemini 3.1 Pro Preview
AA · May 1, 2026
24.52s
#224 · o4 mini
AA · May 1, 2026
24.61s
#225 · Claude Opus 4.6
AA · May 1, 2026
24.72s
#226 · Granite 3.3 8B
AA · May 1, 2026
26.03s
#227 · o3 mini
AA · May 1, 2026
28.32s
#228 · Gemini 3 Pro Preview
AA · May 1, 2026
28.47s
#229 · o1
AA · May 1, 2026
28.91s
#230 · GLM-4.5V
AA · May 1, 2026
32.72s
#231 · GPT-5.1
AA · May 1, 2026
35.74s
#232 · GPT-5
AA · May 1, 2026
88.33s
#233 · GPT-5.3 Codex
AA · May 1, 2026
92.14s
#234 · GPT-5.4 nano
AA · May 1, 2026
102.74s
#235 · o3-pro
AA · May 1, 2026
105.02s
#236 · Claude Sonnet 4.6
AA · May 1, 2026
105.91s
#237 · GPT-5.5
AA · May 1, 2026
111.36s
#238 · GPT-5.4 mini
AA · May 1, 2026
118.10s
#239 · GPT-5.2
AA · May 1, 2026
151.35s
#240 · GPT-5.4
AA · May 1, 2026
188.47s