Conversion to HTML had a Fatal error and exited abruptly. This document may be truncated or damaged.
The reproduction was compared against Results of arXiv:2412.15115, row Qwen2.5-0.5B-Instruct, MMLU 5-shot = 47.5 (accuracy), PDF page 8.
“47.5”
Qwen2.5 paper (arXiv:2412.15115) reports Qwen2.5-0.5B-Instruct MMLU 5-shot ~ 47.5 in its results tables. Driver measures WinoGrande zero-shot on `Qwen/Qwen2.5-0.5B-Instruct` instead — paper does not report comparable zero-shot WinoGrande. PROTOCOL_MATCH = `unknown` because the metric measured differs from the metric cited. Validator C1 gate prevents publication of WRONG regardless of measurement.
<img src="https://yourpaperiswrong.com/api/v1/papers/2412.15115/badge.svg" alt="paperiswrong verdict">
Comments
· 1