Recent verdicts

Your feed.

POST verdicts published across the corpus, grouped by day. Sign in to follow venues, labs, and authors and have this list narrow to what you care about.

Personalize this feed

Follow venues, labs, and authors to narrow this list to your reading list.

50 verdicts on this pageWall of Wrong (full timeline) →

Earlier(50)

PARTIAL
2 OLMo 2 Furious
· arXiv 2024 · cs.CL
REPRODUCED
RoBERTa: A Robustly Optimized BERT Pretraining Approach
· arXiv preprint · cs.CL
REPRODUCED
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
· COLM 2024 · cs.LG
REPRODUCED
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
· NeurIPS 2019 EMC^2 Workshop · cs.CL
REPRODUCED
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
· NAACL 2019 · cs.CL
PARTIAL
Stable LM 2 1.6B Technical Report
· arXiv 2024 · cs.CL
PARTIAL
OLMoE: Open Mixture-of-Experts Language Models
· arXiv 2024 · cs.CL
PARTIAL
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
· arXiv 2025 · cs.CL
PARTIAL
SmolLM2: When Smol Goes Big — Data-Centric Training of a Small Language Model
· arXiv 2025 · cs.CL
PARTIAL
Qwen2.5 Technical Report
· arXiv 2024 · cs.CL
REPRODUCED
Yi: Open Foundation Models by 01.AI
· arXiv 2024 · cs.CL
PARTIAL
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
· arXiv 2024 · cs.CL
PARTIAL
Searching for MobileNetV3
· ICCV 2019 · cs.CV
REPRODUCED
Mistral 7B
· arXiv 2023 · cs.CL
PARTIAL
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
· COLM 2024 · cs.CL
PARTIAL
Gemma: Open Models Based on Gemini Research and Technology
· arXiv 2024 · cs.CL
PARTIAL
TinyLlama: An Open-Source Small Language Model
· arXiv 2024 · cs.CL
REPRODUCED
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing
· ICLR 2023 · cs.CL
REPRODUCED
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
· arXiv 2022 · cs.CL
PARTIAL
Code Llama: Open Foundation Models for Code
· arXiv 2023 · cs.CL
PARTIAL
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
· arXiv 2024 · cs.SE
PARTIAL
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
· ICML 2023 · cs.CL
REPRODUCED
OPT: Open Pre-trained Transformer Language Models
· arXiv 2022 · cs.CL
PARTIAL
Swin Transformer V2: Scaling Up Capacity and Resolution
· CVPR 2022 · cs.CV
REPRODUCED
XLNet: Generalized Autoregressive Pretraining for Language Understanding
· NeurIPS 2019 · cs.CL
REPRODUCED
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
· EMNLP 2019 · cs.CL
PARTIAL
StarCoder: may the source be with you!
· arXiv 2023 · cs.CL
REPRODUCED
LoRA: Low-Rank Adaptation of Large Language Models
· ICLR 2022 · cs.CL
PARTIAL
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
· ICML 2021 · cs.CV
REPRODUCED
Qwen2 Technical Report
· arXiv 2024 · cs.CL
PARTIAL
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
· ICML 2019 · cs.LG
PARTIAL
The Falcon Series of Open Language Models
· arXiv 2023 · cs.CL
PARTIAL
Textbooks Are All You Need II: phi-1.5 technical report
· arXiv 2023 · cs.CL
PARTIAL
Big Bird: Transformers for Longer Sequences
· NeurIPS 2020 · cs.LG
PARTIAL
Scaling Instruction-Finetuned Language Models
· arXiv 2022 · cs.LG
PARTIAL
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
· ICML 2023 · cs.CV
REPRODUCED
DINOv2: Learning Robust Visual Features without Supervision
· TMLR 2024 · cs.CV
REPRODUCED
Emerging Properties in Self-Supervised Vision Transformers
· ICCV 2021 · cs.CV
PENDING
A ConvNet for the 2020s
· CVPR 2022 · cs.CV
PARTIAL
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
· ACL 2020 · cs.CL
REPRODUCED
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
· ICLR 2021 · cs.CL
REPRODUCED
Learning Transferable Visual Models From Natural Language Supervision
· ICML 2021 · cs.CV
REPRODUCED
Robust Speech Recognition via Large-Scale Weak Supervision
· arXiv preprint (Whisper) · cs.CL
REPRODUCED
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
· ICLR 2021 · cs.CV
PARTIAL
Deep Residual Learning for Image Recognition
· CVPR 2016 · cs.CV
REPRODUCED
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
· ICLR 2020 · cs.CL
REPRODUCED
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
· JMLR 2020 · cs.LG
REPRODUCED
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
· ICLR 2020 · cs.CL
REPRODUCED
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
· ACL 2020 · cs.CL
PENDING
Llama 2: Open Foundation and Fine-Tuned Chat Models
· arXiv preprint · cs.CL
“WRONG” is a technical term — it means a headline numerical claim did not reproduce on our reproduction job, within published tolerance. Authors have right of reply, prominently. Definition →