BB · benchmark platform
BridgeBench
Coding-heavy evaluation focused on UI generation, debugging, refactoring, security, hallucination, and speed.
verification status
verified
Last checked May 13, 2026
Evidence ledger
ModalitiescodeCadencecontinuousAPInot publicEvaluations122VerificationverifiedVerified runtime122Manual verified0Relay / mirrored0Backfilled0
Relay sources mirror another provider's public page; manual rows are checked against the cited page; backfilled rows are historical inserts; seeded rows are demo fixtures. Relay rows are supporting evidence, not first-party measurements.
Operational state
snapshot
Latest pull
jsonMay 13, 2026
parser
Loaded 122 live BridgeBench benchmark records.
ok0.2.0
verify
bridgebench verification finished with status verified.
verifiedMay 13, 2026
Benchmarks from this source
Debugging
Debugging
Score
Security
Security review
Score
BS pushback
Premise rejection and hallucination resistance
Pushback rate
Speed throughput
Code-generation throughput
Throughput
Speed TTFT
Initial latency
TTFT
Latest change explanation
bridgebench matched bridgebench-20260513T010647Z with no notable change causes detected.