Papers (2)
- DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
arXiv:1910.01108 · NeurIPS 2019 EMC^2 Workshop
POSTREPRODUCEDPREpending - BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
arXiv:2211.05100 · arXiv 2022
POSTREPRODUCEDPREpending