Adaptively scheduling parallel loops in distributed shared-memory systems

被引:33
|
作者
Yan, Y [1 ]
Jin, CM [1 ]
Zhang, XD [1 ]
机构
[1] INTERVOICE INC,DALLAS,TX
基金
美国国家科学基金会;
关键词
adaptive scheduling algorithms; dynamic information; load balancing; parallel loops; processor affinity; shared-memory systems;
D O I
10.1109/71.569656
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Using runtime information of load distributions and processor affinity, we propose an adaptive scheduling algorithm and its variations from different control mechanisms. The proposed algorithm applies different degrees of aggressiveness to adjust loop scheduling granularities, aiming at improving the execution performance of parallel loops by making scheduling decisions that match the real workload distributions at runtime. We experimentally compared the performance of our algorithm and its variations with several existing scheduling algorithms on two parallel machines: the KSR-1 and the Convex Exemplar. The kernel application programs we used for performance evaluation were carefully selected for different classes of parallel loops. Our results show that using runtime information to adaptively adjust scheduling granularity is an effective way to handle loops with a wide range of load distributions when no prior knowledge of the execution can be used. The overhead caused by collecting runtime information is insignificant in comparison with the performance improvement. Our experiments show that the adaptive algorithm and its five variations outperformed the existing scheduling algorithms.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 50 条
  • [21] Parallel image processing for line detection in shared-memory and distributed environments
    Kyrki, V
    Ikonen, J
    Porras, J
    Kälviäinen, H
    INTELLIGENT ROBOTS AND COMPUTER VISION XIX: ALGORITHMS, TECHNIQUES, AND ACTIVE VISION, 2000, 4197 : 89 - 97
  • [22] A shared-memory multiprocessor scheduling algorithm
    Zuccar, Irene
    Solar, Mauricio
    Kri, Fernanda
    Parada, Victor
    PROFESSIONAL PRACTICE IN ARTIFICIAL INTELLIGENCE, 2006, 218 : 313 - +
  • [23] MEMORY MANAGEMENT FOR PARALLEL TASKS IN SHARED-MEMORY
    LANGENDOEN, KG
    MULLER, HL
    VREE, WG
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 637 : 165 - 178
  • [24] Memory latency in distributed shared-memory multiprocessors
    Motlagh, BS
    DeMara, RF
    PROCEEDINGS IEEE SOUTHEASTCON '98: ENGINEERING FOR A NEW ERA, 1998, : 134 - 137
  • [25] Parallel Data Mining for Association Rules on Shared-Memory Systems
    S. Parthasarathy
    M. J. Zaki
    M. Ogihara
    W. Li
    Knowledge and Information Systems, 2001, 3 (1) : 1 - 29
  • [26] Scalable Parallel Fault Simulation for Shared-Memory Multiprocessor Systems
    Hadjitheophanous, Stavros
    Neophytou, Stelios N.
    Michael, Maria K.
    2016 IEEE 34TH VLSI TEST SYMPOSIUM (VTS), 2016,
  • [27] LIBMF: A Library for Parallel Matrix Factorization in Shared-memory Systems
    Chin, Wei-Sheng
    Yuan, Bo-Wen
    Yang, Meng-Yuan
    Zhuang, Yong
    Juan, Yu-Chin
    Lin, Chih-Jen
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [28] COMPARING DISTRIBUTED-MEMORY AND VIRTUAL SHARED-MEMORY PARALLEL PROGRAMMING-MODELS
    KEANE, JA
    GRANT, AJ
    XU, MQ
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING AND ESCIENCE, 1995, 11 (02): : 233 - 243
  • [29] Predicting reconfigurable interconnect performance in distributed shared-memory systems
    Heirman, W.
    Dambre, J.
    Artundo, I.
    Debaes, C.
    Thienpont, H.
    Stroobandt, D.
    Van Campenhout, J.
    INTEGRATION-THE VLSI JOURNAL, 2007, 40 (04) : 382 - 393
  • [30] Analysis of failure recovery schemes for distributed shared-memory systems
    Kim, JH
    Vaidya, NH
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1999, 146 (03): : 125 - 130