High efficient training method of MiniGo on large-scale heterogeneous computing platform

被引:0
|
作者
Li, Rongchun [1 ]
He, Zhouyu [1 ]
Qiao, Peng [1 ]
Jiang, Jingfei [1 ]
Dou, Yong [1 ]
Li, Dongsheng [1 ]
机构
[1] National Key Laboratory of Parallel and Distributed Computing, National University of Defense Technology, Changsha,410073, China
关键词
An efficient multi-level parallel training method suitable for training MiniGo agents on large-scale heterogeneous computing platforms was proposed; including task level parallelism between nodes; CPU-DSP (central processing unit-digital signal process) heterogeneous parallelism and DSP core parallelism. Efficient input/output deployment and eliminated the bottleneck of network communication were realized. A heterogeneous computing memory management oriented to CPU-DSP shared memory structure was proposed to reduce the data handling between heterogeneous devices. Shared memory programming optimization was realized; and the dense convolution calculation operator acceleration optimization was realized by DSP. Results show that compared with 16 core CPU calculation; the maximum acceleration ratio of single core DSP operator acceleration is 16. 44. In this method; the scale of computing nodes is expanded from 1 067 to 4 139; the time required to reach the given termination condition is reduced from 43. 02 h to 16. 05 h; and the expansion efficiency is 69. 1%. Evaluation shows that this method can realize the efficient parallel training of MiniGo on large-scale heterogeneous computing platforms. © 2024 National University of Defense Technology. All rights reserved;
D O I
10.11887/j.cn.202405022
中图分类号
学科分类号
摘要
引用
收藏
页码:209 / 218
相关论文
共 50 条
  • [31] An Efficient Grouping Method for Large-Scale MBIST
    Yang, Rongjie
    Wang, Zheng
    Shen, Minghua
    2024 INTERNATIONAL SYMPOSIUM OF ELECTRONICS DESIGN AUTOMATION, ISEDA 2024, 2024, : 486 - 491
  • [32] An Efficient Method for Large-Scale Gate Sizing
    Joshi, Siddharth
    Boyd, Stephen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2008, 55 (09) : 2760 - 2773
  • [33] Performance Analysis of the FCBSH Algorithm for Large-Scale Heterogeneous Computing Environments
    Du, Xiaoli
    Jiang, Changjun
    Yin, Fei
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2009, 3 (04) : 461 - 476
  • [34] Editorial Note: Large-scale Heterogeneous Multimedia Data Computing and Understanding
    Gao, Zan
    Zhang, Hanwang
    Wang, Charles
    Yang, Yi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22033 - 22033
  • [35] Arbor: Efficient Large-Scale Graph Data Computing Model
    Zhou, Wei
    Li, Bo
    Han, Jizhong
    Xu, Zhiyong
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 300 - 307
  • [36] Optasia: A Relational Platform for Efficient Large-Scale Video Analytics
    Lu, Yao
    Chowdhery, Aakanksha
    Kandula, Srikanth
    PROCEEDINGS OF THE SEVENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC 2016), 2016, : 57 - 70
  • [37] Challenges and Opportunities to Enable Large-Scale Computing via Heterogeneous Chiplets
    Yang, Zhuoping
    Ji, Shixin
    Chen, Xingzhen
    Zhuang, Jinming
    Zhang, Weifeng
    Jani, Dharmesh
    Zhou, Peipei
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 765 - 770
  • [38] An efficient classification approach for large-scale mobile ubiquitous computing
    Tang, Feilong
    You, Ilsun
    Tang, Can
    Guo, Minyi
    INFORMATION SCIENCES, 2013, 232 : 419 - 436
  • [39] Editorial Note: Large-scale Heterogeneous Multimedia Data Computing and Understanding
    Multimedia Tools and Applications, 2018, 77 : 22033 - 22033
  • [40] Tuning Heterogeneous Computing Platforms for Large-Scale Hydrology Data Management
    Leonard, Lorne
    Madduri, Kamesh
    Duffy, Christopher J.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (09) : 2753 - 2765