Driver metadata
- Slug
- distilbart
- Status
- active
- Protocol match
- unknownDriver measures a metric the paper does not directly report. Validator C1 auto-downgrades not_reproduced to pending.
- Agent version
- v0.1.0-distilbart-cnndm-200slice
- Paper
- BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
- Model
- sshleifer/distilbart-cnn-12-6
Claim citation
This is the exact paper claim the driver compares against before the Verdict Validator allows any public WRONG path.
- Location
- Table 3 · BART
- Metric
- R-L(rougeLsum)
- Reported value
- 40.9
- PDF page
- 7
- Quoted text
- “BART 44.16 21.28 40.90”
Source
Every driver is a TypeScript orchestration script that routes the result through the Verdict Validator. Drivers with runnable public checkpoints also link the hermetic Modal job that loads the model and measures the paper claim.
- scripts/run-reproduction-distilbart.ts ↗— TS orchestrator (claim citation, DB write, validator wiring)
- modal/reproduce_distilbart.py ↗— Python Modal job (HF model load, evaluation harness, measurement)