Recent verdicts

Your feed.

POST verdicts published across the corpus, grouped by day. Sign in to follow venues, labs, and authors and have this list narrow to what you care about.

Personalize this feed

Follow venues, labs, and authors to narrow this list to your reading list.

Sign in to follow Browse leaderboards →

50 verdicts on this pageWall of Wrong (full timeline) →

Earlier(50)

2 OLMo 2 Furious

· arXiv 2024 · cs.CL

reported → reproduced— → pending

RoBERTa: A Robustly Optimized BERT Pretraining Approach

· arXiv preprint · cs.CL

reported → reproduced— → pending

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

· COLM 2024 · cs.LG

reported → reproduced— → pending

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

· NeurIPS 2019 EMC^2 Workshop · cs.CL

reported → reproduced— → pending

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

· NAACL 2019 · cs.CL

reported → reproduced— → pending

Stable LM 2 1.6B Technical Report

· arXiv 2024 · cs.CL

reported → reproduced— → pending

OLMoE: Open Mixture-of-Experts Language Models

· arXiv 2024 · cs.CL

reported → reproduced— → pending

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

· arXiv 2025 · cs.CL

reported → reproduced— → pending

SmolLM2: When Smol Goes Big — Data-Centric Training of a Small Language Model

· arXiv 2025 · cs.CL

reported → reproduced— → pending

Qwen2.5 Technical Report

· arXiv 2024 · cs.CL

reported → reproduced— → pending

Yi: Open Foundation Models by 01.AI

· arXiv 2024 · cs.CL

reported → reproduced— → pending

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

· arXiv 2024 · cs.CL

reported → reproduced— → pending

Searching for MobileNetV3

· ICCV 2019 · cs.CV

reported → reproduced— → pending

· arXiv 2023 · cs.CL

reported → reproduced— → pending

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

· COLM 2024 · cs.CL

reported → reproduced— → pending

Gemma: Open Models Based on Gemini Research and Technology

· arXiv 2024 · cs.CL

reported → reproduced— → pending

TinyLlama: An Open-Source Small Language Model

· arXiv 2024 · cs.CL

reported → reproduced— → pending

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

· ICLR 2023 · cs.CL

reported → reproduced— → pending

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

· arXiv 2022 · cs.CL

reported → reproduced— → pending

Code Llama: Open Foundation Models for Code

· arXiv 2023 · cs.CL

reported → reproduced— → pending

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

· arXiv 2024 · cs.SE

reported → reproduced— → pending

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

· ICML 2023 · cs.CL

reported → reproduced— → pending

OPT: Open Pre-trained Transformer Language Models

· arXiv 2022 · cs.CL

reported → reproduced— → pending

Swin Transformer V2: Scaling Up Capacity and Resolution

· CVPR 2022 · cs.CV

reported → reproduced— → pending

XLNet: Generalized Autoregressive Pretraining for Language Understanding

· NeurIPS 2019 · cs.CL

reported → reproduced— → pending

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

· EMNLP 2019 · cs.CL

reported → reproduced— → pending

StarCoder: may the source be with you!

· arXiv 2023 · cs.CL

reported → reproduced— → pending

LoRA: Low-Rank Adaptation of Large Language Models

· ICLR 2022 · cs.CL

reported → reproduced— → pending

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

· ICML 2021 · cs.CV

reported → reproduced— → pending

Qwen2 Technical Report

· arXiv 2024 · cs.CL

reported → reproduced— → pending

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

· ICML 2019 · cs.LG

reported → reproduced— → pending

The Falcon Series of Open Language Models

· arXiv 2023 · cs.CL

reported → reproduced— → pending

Textbooks Are All You Need II: phi-1.5 technical report

· arXiv 2023 · cs.CL

reported → reproduced— → pending

Big Bird: Transformers for Longer Sequences

· NeurIPS 2020 · cs.LG

reported → reproduced— → pending

Scaling Instruction-Finetuned Language Models

· arXiv 2022 · cs.LG

reported → reproduced— → pending

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

· ICML 2023 · cs.CV

reported → reproduced— → pending

DINOv2: Learning Robust Visual Features without Supervision

· TMLR 2024 · cs.CV

reported → reproduced— → pending

Emerging Properties in Self-Supervised Vision Transformers

· ICCV 2021 · cs.CV

reported → reproduced— → pending

A ConvNet for the 2020s

· CVPR 2022 · cs.CV

reported → reproduced— → pending

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

· ACL 2020 · cs.CL

reported → reproduced— → pending

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

· ICLR 2021 · cs.CL

reported → reproduced— → pending

Learning Transferable Visual Models From Natural Language Supervision

· ICML 2021 · cs.CV

reported → reproduced— → pending

Robust Speech Recognition via Large-Scale Weak Supervision

· arXiv preprint (Whisper) · cs.CL

reported → reproduced— → pending

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

· ICLR 2021 · cs.CV

reported → reproduced— → pending

Deep Residual Learning for Image Recognition

· CVPR 2016 · cs.CV

reported → reproduced— → pending

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

· ICLR 2020 · cs.CL

reported → reproduced— → pending

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

· JMLR 2020 · cs.LG

reported → reproduced— → pending

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

· ICLR 2020 · cs.CL

reported → reproduced— → pending

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices

· ACL 2020 · cs.CL

reported → reproduced— → pending

Llama 2: Open Foundation and Fine-Tuned Chat Models

· arXiv preprint · cs.CL

reported → reproduced— → pending

“WRONG” is a technical term — it means a headline numerical claim did not reproduce on our reproduction job, within published tolerance. Authors have right of reply, prominently. Definition →