A Static Scheduling Scheme of Multicore Compiler for Loop Load Imbalance in OpenMP

被引:0
|
作者
Chen, Benbin [1 ]
Guo, Donghui [1 ,2 ]
机构
[1] Xiamen Univ, Dept Elect Engn, Xiamen 361005, Fujian, Peoples R China
[2] Xiamen Univ, IC Design & IT Res Ctr Fujian Prov, Xiamen 361005, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Static scheduling; Compiler; Loop load balance; OpenMP;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A static scheduling scheme in OpenMP is presented in this paper to solve the loop load imbalance for load decreasing loop. Loop load balance is the key issue to improve the speedup of parallel program. Different from the traditional default static scheduling based on OpenMP specification, which the loop iterations are divided into chunks that are approximately equal in size, and the chunks are assigned to the threads in a roundrobin fashion in the order of the thread number, the iterations allocation is modified based on the loop load. Especially, for load decreasing loop, the new static scheduling scheme hopes to calculate the load of each assigned chunks to keep the balance of work in the loop. The heuristic area calculation method could be used to calculate the load of each chunk. As a result, the load balance is improved by the new static scheduling scheme to ensure the parallel efficiency of OpenMP programs.
引用
收藏
页码:30 / 33
页数:4
相关论文
共 50 条
  • [41] A Compressed Data Partition and Loop Scheduling Scheme for Neural Networks
    Li, Dejian
    Fang, Rongqiang
    Wang, Jing
    Zhao, Dongyan
    Chong, Ting
    Ren, Zengmin
    Ma, Jun
    IEEE ACCESS, 2022, 10 : 95219 - 95228
  • [42] A hybrid parallel loop scheduling scheme on heterogeneous PC clusters
    Shih, WC
    Yang, CT
    Chen, PI
    Tseng, SS
    PDCAT 2005: SIXTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2005, : 56 - 58
  • [43] A parallel loop scheduling scheme on field programmable gate arrays
    Lu, Zhijian
    ICIC Express Letters, Part B: Applications, 2016, 7 (02): : 363 - 368
  • [44] Performance-Based Parallel Loop Self-scheduling on Heterogeneous Multicore PC Clusters
    Yang, Chao-Tung
    Chang, Jen-Hsiang
    Wu, Chao-Chin
    HIGH PERFORMANCE COMPUTING AND APPLICATIONS, 2010, 5938 : 509 - +
  • [45] Adaptive TDM Scheduling Scheme for Load-Balanced Switches
    Xia, Yu
    Zeng, Huaxin
    Shen, Zhijun
    Gao, Zhijiang
    IEEE COMMUNICATIONS LETTERS, 2011, 15 (07) : 758 - 760
  • [46] A Power Hardware-In-the-Loop Scheme for Load Emulation Applications
    Mejia-Barron, Arturo
    Valtierra-Rodriguez, Martin
    Pablo Amezquita-Sanchez, Juan
    Granados-Lieberman, David
    Tapia-Tinoco, Guillermo
    2018 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2018,
  • [47] Dynamic Voltage and Frequency Scaling based Parallel Scheduling Scheme for Video Recognition on Multicore Systems
    Chen, Yen-Lin
    Chang, Ming-Feng
    Liang, Wen-Yew
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2016, : 357 - 358
  • [48] Federated Scheduling Optimization Scheme for Typed Tasks With Power Constraints in Heterogeneous Multicore Processor Architectures
    Wen, Xiaohong
    Liu, Guojin
    Li, Dejian
    Yu, Yantao
    Zhao, Haisen
    Huang, Tiancong
    IEEE ACCESS, 2023, 11 : 85728 - 85746
  • [49] SAFE SELF-SCHEDULING - A PARALLEL LOOP SCHEDULING SCHEME FOR SHARED-MEMORY MULTIPROCESSORS
    LIU, J
    SALETORE, VA
    LEWIS, TG
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1994, 22 (06) : 589 - 616
  • [50] A static/opportunistic hybrid-scheduling scheme for MIMO wireless networks
    Chen, Weiwei
    Lea, Chin-Tau
    WIRELESS NETWORKS, 2018, 24 (02) : 497 - 512