paperiswrong

Counters

Reproductions run

186

every POST verdict in the database

Papers indexed

21k

arXiv ingest

Verdicts (24h)

POST verdicts with computed_at > now − 24h

Reproduce rate

—

(REPRODUCED + 0.5·PARTIAL) / total current decisive POST

The headline trust signal isn't the reproduce rate. It's how the platform handles its own errors.

8 retractions on file — read the full log (every retracted verdict preserved with date, reason, and audit thread). The 2026-05-13 rollup retraction of all seven then-public WRONG verdicts is the credibility test the platform actively chose to take.
Every decisive public result requires a row-bound claim receipt; unvalidated driver output remains pending. See the audit thread at /legal/retractions/2026-05-13 for what happens without one.
The headline verdict label WRONGis a technical term: “our reproduction did not match this paper's reported numbers.” See /methodology/wrong.

Every number above is reachable through the public REST API:

GET /api/v1/verdicts?limit=100 — paginated verdict listing with claim_citation + protocol_match on every row.
GET /api/v1/papers/<arxivId> — per-paper detail including the most-recent POST + PRE verdicts.
GET /api/v1/health — health-check.