Admin tool · Retrospective validation

Retrospective validation runs

Queue and review internal benchmark checks after the research workflow is complete. This page does not produce client-facing claims.
6 runs·Mean Δ +9.0 pts·1 in flight

Validation queue

New comparison run

Comparison

Lung adenocarcinoma

Baseline vs prototype relevance

Runs

Benchmark run log

6 results
IDLabelStartedStatusClassicalPrototypeΔReviewer
bench-0042Top-10 ranking agreement · LUAD retro4h agoComplete6271+9Dr. Tan Wei Lin
bench-0041Pathway mapping consistency · BRCA retro1d agoComplete7078+8Dr. Aiman Rashid
bench-0040Evidence-category agreement · CRC retro3d agoIn review6574+9Prof. Lim Boon Eng
bench-0039Top-k precision · LUAD retro6d agoComplete5869+11Dr. Tan Wei Lin
bench-0038Run-to-run stability · CRC retro8d agoComplete8492+8Prof. Lim Boon Eng
bench-0037Prototype layer noise sweep42m agoQueued