Visible tradeoffsThis is a rubric-judged signal, so it is more structured than arena taste but still depends on the scoring rubric.
source
Scale Labs
metric
Honesty score (%)
judge
Rubric
direction
higher better
group id
scale_mask_current
domain
Safety
What it measures vs what it misses
✓ Measures
Whether a model stays honest instead of covertly optimizing against the user.
✗ Misses
General capability breadth. Tool-use or retrieval quality.
Why this countsWhether a model stays honest instead of covertly optimizing against the user.Comparable-group ruleThis percentile only compares models inside the exact benchmark/version group shown here. It is not a universal score.What it missesGeneral capability breadth.