Driver

flan-t5

Reproduction driver for Scaling Instruction-Finetuned Language Models. Source file: scripts/run-reproduction-flan-t5.ts.

Driver metadata

Slug
flan-t5
Status
active
Protocol match
proxyDriver measures a proxy of the cited metric. Validator C1 auto-downgrades not_reproduced to partial.
Agent version
v0.1.0-flan-t5-mmlu-microslice
Paper
Scaling Instruction-Finetuned Language Models
Model
google/flan-t5-large

Claim citation

This is the exact paper claim the driver compares against before the Verdict Validator allows any public WRONG path.

Location
Table 6 · FLAN-T5-Large
Metric
MMLU 0-shot direct(accuracy)
Reported value
45.1
PDF page
14
Quoted text
FLAN-T5-Large 45.1

Source

Every driver is a TypeScript orchestration script that routes the result through the Verdict Validator. Drivers with runnable public checkpoints also link the hermetic Modal job that loads the model and measures the paper claim.