共 50 条
- [41] Generative Inference of Large Language Models in Edge Computing: An Energy Efficient Approach 20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 244 - 249
- [42] Tabi: An Efficient Multi-Level Inference System for Large Language Models PROCEEDINGS OF THE EIGHTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, EUROSYS 2023, 2023, : 233 - 248
- [44] An efficient quantized GEMV implementation for large language models inference with matrix core JOURNAL OF SUPERCOMPUTING, 2025, 81 (03):
- [45] Distributed Inference and Fine-tuning of Large Language Models Over The Internet ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [46] Assessing Large Language Models for Oncology Data Inference From Radiology Reports JCO CLINICAL CANCER INFORMATICS, 2024, 8
- [47] Context-Aware Abbreviation Expansion Using Large Language Models NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1261 - 1275
- [48] Are Emergent Abilities in Large Language Models just In-Context Learning? PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 5098 - 5139
- [49] Towards a benchmark dataset for large language models in the context of process automation DIGITAL CHEMICAL ENGINEERING, 2024, 13
- [50] Visual In-Context Learning for Large Vision-Language Models FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 15890 - 15902