Papers (4)
- RoBERTa: A Robustly Optimized BERT Pretraining Approach
arXiv:1907.11692 · arXiv preprint
POSTREPRODUCEDPREpending - A ConvNet for the 2020s
arXiv:2201.03545 · CVPR 2022
POSTPENDINGPREpending - BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
arXiv:1910.13461 · ACL 2020
POSTPARTIALPREpending - Llama 2: Open Foundation and Fine-Tuned Chat Models
arXiv:2307.09288 · arXiv preprint
POSTPENDINGPREpending