Combine Thread with Memory Scheduling for Maximizing Performance in Multi-core Systems

被引:0
|
作者
Jia, Gangyong [1 ]
Han, Guangjie [2 ]
Shi, Liang [3 ]
Wan, Jian [1 ]
Dai, Dong [4 ]
机构
[1] Hangzhou Dianzi Univ, Dept Comp Sci & Technol, Hangzhou 310018, Zhejiang, Peoples R China
[2] Hohai Univ, Dept Comp Sci, Changzhou 213022, Peoples R China
[3] Chongqing Univ, Dept Comp Sci & Technol, Chongqing 400044, Peoples R China
[4] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
基金
美国国家科学基金会;
关键词
Thread scheduling; memory scheduling; memory interference; memory access time; performance; energy;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The growing gap between microprocessor speed and DRAM speed is a major problem that computer designers are facing. In order to narrow the gap, it is necessary to improve DRAM's speed and throughput. Moreover, on multi-core platforms, DRAM memory shared by all cores usually suffers from the memory contention and interference problem, which can cause serious performance degradation and unfairness among parallel running threads. To address these problems, this paper proposes techniques to take both advantages of partitioning cores, threads and memory banks into groups to reduce interference among different groups and grouping the memory accesses of the same row together to reduce cache miss rate. A memory optimization framework combined thread scheduling with memory scheduling (CTMS) is proposed in this paper, which simultaneously minimizes memory access schedule length, memory access time and reduce interference to maximize performance for multi-core systems. Experimental results show CTMS is 12.6% shorter in memory access time, while improving 11.8% throughput on average. Moreover, CTMS also saves 5.8% of the energy consumption.
引用
收藏
页码:298 / 305
页数:8
相关论文
共 50 条
  • [31] I/O scheduling and performance analysis on multi-core platforms
    Liu, Zhaobin
    Qu, Wenyu
    Li, Haitao
    Ruan, Min
    Zhou, Wanlei
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2009, 21 (10): : 1405 - 1417
  • [32] Graph Support and Scheduling for OpenCL on Heterogeneous Multi-core Systems
    Chien, Shih-Huan
    Chang, Yuan-Ming
    Yang, Chun-Chieh
    Hwang, Yuan-Shin
    Lee, Jenq-Kuen
    47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP '18), 2018,
  • [33] Maximizing Throughput of Temperature-Constrained Multi-Core Systems with 3D-Stacked Cache Memory
    Kang, Kyungsu
    Jung, Jongpil
    Yoo, Sungjoo
    Kyung, Chong-Min
    2011 12TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED), 2011, : 577 - 582
  • [34] Exploitation of Nested Thread-Level Speculative Parallelism on Multi-Core Systems
    Kejariwal, Arun
    Girkar, Milind
    Tian, Xinmin
    Saito, Hideki
    Nicolau, Alexandru
    Veidenbaum, Alexander V.
    Banerjee, Utpal
    Polychronopoulos, Constantine D.
    PROCEEDINGS OF THE 2010 COMPUTING FRONTIERS CONFERENCE (CF 2010), 2010, : 99 - 100
  • [35] Thread Motion: Fine-Grained Power Management for Multi-Core Systems
    Rangan, Krishna K.
    Wei, Gu-Yeon
    Brooks, David
    ISCA 2009: 36TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2009, : 302 - 313
  • [36] iBuddy: Inverse Buddy for Enhancing Memory Allocation/Deallocation Performance on Multi-Core Systems
    Park, Heekwon
    Choi, Jongmoo
    Lee, Donghee
    Noh, Sam H.
    IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (03) : 720 - 732
  • [37] Memory Access Pattern-Aware DRAM Performance Model for Multi-Core Systems
    Choi, Hyojin
    Lee, Jongbok
    Sung, Wonyong
    IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2011), 2011, : 66 - 75
  • [38] PPT: Joint Performance/Power/Thermal Management of DRAM Memory for Multi-Core Systems
    Lin, Chung-Hsiang
    Yang, Chia-Lin
    King, Ku-Jei
    ISLPED 09, 2009, : 93 - 98
  • [39] Modeling Memory Concurrency for Multi-Socket Multi-Core Systems
    Mandal, Anirban
    Fowler, Rob
    Porterfield, Allan
    2010 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2010), 2010, : 66 - 75
  • [40] Thread Lock Section-Aware Scheduling on Asymmetric Single-ISA Multi-Core
    Markovic, Nikola
    Nemirovsky, Daniel
    Unsal, Osman
    Valero, Mateo
    Cristal, Adrian
    IEEE COMPUTER ARCHITECTURE LETTERS, 2015, 14 (02) : 160 - 163