Driver metadata
- Slug
- xlm-r
- Status
- retractedOriginal XLM-R WRONG verdict retracted 2026-05-13. Driver disabled.
- Protocol match
- —Protocol-match tier not declared on the driver.
- Agent version
- v0.1.0-xlm-r-xnli-microslice
- Paper
- Unsupervised Cross-lingual Representation Learning at Scale
- Model
- joeddav/xlm-roberta-large-xnli
Source
Every driver is a TypeScript orchestration script that routes the result through the Verdict Validator. Drivers with runnable public checkpoints also link the hermetic Modal job that loads the model and measures the paper claim.
- scripts/run-reproduction-xlm-r.ts ↗— TS orchestrator (claim citation, DB write, validator wiring)
- modal/reproduce_xlm_r.py ↗— Python Modal job (HF model load, evaluation harness, measurement)