Driver metadata
- Slug
- olmo
- Status
- retractedOriginal OLMo WRONG verdict retracted 2026-05-13 (LAMBADA fabrication). Driver is now a `not_attempted` stub.
- Protocol match
- —Protocol-match tier not declared on the driver.
- Agent version
- v0.1.1-olmo-not-attempted-stub
- Paper
- OLMo: Accelerating the Science of Language Models
- Model
- allenai/OLMo-1B-hf
Source
Every driver is a TypeScript orchestration script that routes the result through the Verdict Validator. Drivers with runnable public checkpoints also link the hermetic Modal job that loads the model and measures the paper claim.
- scripts/run-reproduction-olmo.ts ↗— TS orchestrator (claim citation, DB write, validator wiring)
- modal/reproduce_olmo.py ↗— Python Modal job (HF model load, evaluation harness, measurement)