共 50 条
- [41] Ouroboros: On Accelerating Training of Transformer-Based Language Models ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [42] Transformer Language Models Handle Word Frequency in Prediction Head FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4523 - 4535
- [43] A Comparison of Transformer-Based Language Models on NLP Benchmarks NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 490 - 501
- [44] Pushdown Layers: Encoding Recursive Structure in Transformer Language Models 2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 3233 - 3247
- [46] TAG: Gradient Attack on Transformer-based Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3600 - 3610
- [47] The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, 2019, : 127 - 133
- [48] Probabilistic generative transformer language models for generative design of molecules Journal of Cheminformatics, 15
- [50] Ecco: An Open Source Library for the Explainability of Transformer Language Models ACL-IJCNLP 2021: THE JOINT CONFERENCE OF THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 249 - 257