Benchmarks · /benchmarks/mteb-retrieval-en-v2

Retrieval

MTEB retrieval slice for embeddings.

Source · MTEB
Version · mteb snapshot 2026-05-01
Scores · 1

Passport

Visible tradeoffsThis is a retrieval signal, so it is best read as search-stack quality rather than broad model capability.

source

MTEB

metric

NDCG@10 (ndcg)

judge

Retrieval

direction

higher better

group id

mteb_retrieval_en_v2

domain

Embeddings / retrieval

What it measures vs what it misses

✓ Measures

Embedding quality for retrieval tasks.

✗ Misses

Chat quality, generation, latency.

Why this countsIt is one of the few direct signals for retrieval stacks, where embedding quality matters more than chat style.Comparable-group ruleThis percentile only compares models inside the exact benchmark/version group shown here. It is not a universal score.What it missesIt does not tell you whether the same model is strong at generation, ranking policy, or final answer quality.

Leaderboard · this benchmark version

#1 · BAAI bge-large-en-v1.5

MTEB · May 1, 2026

49.3 ndcg