共 50 条
- [2] Enabling Efficient Large-Scale Deep Learning Training with Cache Coherent Disaggregated Memory Systems 2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 126 - 140
- [3] GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, ASPLOS 2024, VOL 2, 2024, : 450 - 466
- [4] Efficient Large-scale Deep Learning Framework for Heterogeneous Multi-GPU Cluster 2019 IEEE 4TH INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W 2019), 2019, : 176 - 181
- [5] Efficient MPI-AllReduce for large-scale deep learning on GPU-clusters CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (12):
- [7] Training large-scale language models with limited GPU memory: a survey FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2025, : 309 - 331
- [8] Resource-efficient Federated Learning for Large-scale Model Training PROCEEDINGS OF THE WORKSHOP ON MOBILITY IN THE EVOLVING INTERNET ARCHITECTURE TO BE HELD IN CONJUNCTION WITH MOBICOM 2024, MOBIARCH 2024, 2024, : 43 - 48
- [9] Large-Scale Semi-Supervised Training in Deep Learning Acoustic Model for ASR IEEE ACCESS, 2019, 7 : 133615 - 133627