Papers (2)
- XLNet: Generalized Autoregressive Pretraining for Language Understanding
arXiv:1906.08237 · NeurIPS 2019
POSTREPRODUCEDPREpending - ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
arXiv:2003.10555 · ICLR 2020
POSTREPRODUCEDPREpending