Data Marshaling for Multi-core Architectures

被引:0
|
作者
Suleman, M. Aater [1 ]
Mutlu, Onur
Joao, Jose A. [1 ]
Khubaib [1 ]
Patt, Yale N. [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
来源
ISCA 2010: THE 37TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE | 2010年
关键词
Staged Execution; Critical Sections; Pipelining; CMP;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Previous research has shown that Staged Execution (SE), i.e., dividing a program into segments and executing each segment at the core that has the data and/or functionality to best run that segment, can improve performance and save power. However, SE's benefit is limited because most segments access inter-segment data, i.e., data generated by the previous segment. When consecutive segments run on different cores, accesses to inter-segment data incur cache misses, thereby reducing performance. This paper proposes Data Marshaling (DM), a new technique to eliminate cache misses to inter-segment data. DM uses profiling to identify instructions that generate inter-segment data, and adds only 96 bytes/core of storage overhead. We show that DM significantly improves the performance of two promising Staged Execution models, Accelerated Critical Sections and producer-consumer pipeline parallelism, on both homogeneous and heterogeneous multi-core systems. In both models, DM can achieve almost all of the potential of ideally eliminating cache misses to inter-segment data. DM's performance benefit increases with the number of cores.
引用
收藏
页码:441 / 450
页数:10
相关论文
共 50 条
  • [31] Sparse matrix operations on several multi-core architectures
    Trinitis, Carsten
    Kuestner, Tilman
    Weidendorfer, Josef
    Smajic, Jasmin
    JOURNAL OF SUPERCOMPUTING, 2011, 57 (02): : 132 - 140
  • [32] Automated Tuning in Parallel Sorting on Multi-core Architectures
    Lin, Haibo
    Li, Chao
    Wang, Qian
    Zhao, Yi
    Pan, Ninghe
    Zhuang, Xiaotong
    Shao, Ling
    EURO-PAR 2010 PARALLEL PROCESSING, PT I, 2010, 6271 : 14 - +
  • [33] MODELS FOR PARALLEL WORKFLOW PROCESSING ON MULTI-CORE ARCHITECTURES
    Rauber, Thomas
    Ruenger, Gudula
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL ISAS-2: INFORMATION SYSTEMS ANALYSIS AND SPECIFICATION, VOL 2, 2008, : 220 - +
  • [34] Optimizing process creation and execution on multi-core architectures
    Kulkarni, Abhishek
    Ionkov, Latchesar
    Lang, Michael
    Lumsdaine, Andrew
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2013, 27 (02): : 147 - 161
  • [35] Automatic Parallelization of Simulink Models for Multi-core Architectures
    Tuncali, Cumhur Erkan
    Fainekos, Georgios
    Lee, Yann-Hang
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 964 - 971
  • [36] Performance issues in emerging homogeneous multi-core architectures
    Kayi, Abdullah
    El-Ghazawi, Tarek
    Newby, Gregory B.
    SIMULATION MODELLING PRACTICE AND THEORY, 2009, 17 (09) : 1485 - 1499
  • [37] Practice of Parallelizing Network Applications on Multi-core Architectures
    Wang, Junchang
    Cheng, Haipeng
    Hua, Bei
    Tang, Xinan
    ICS'09: PROCEEDINGS OF THE 2009 ACM SIGARCH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2009, : 204 - 213
  • [38] A parallel algorithm for coverage optimization on multi-core architectures
    Wei, Ran
    Murray, Alan T.
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2016, 30 (03) : 432 - 450
  • [39] Hardware Supported Time Synchronization in Multi-Core Architectures
    Lynch, Elizabeth Whitaker
    Riley, George F.
    PADS 2009: 23RD WORKSHOP ON PRINCIPLES OF ADVANCED AND DISTRIBUTED SIMULATION, PROCEEDINGS, 2009, : 88 - 94
  • [40] Interconnection Network Performance of Multi-core Cluster Architectures
    Hamid, Norhazlina
    Walters, Robert
    Wills, Gary
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS, AND CONTROL TECHNOLOGY (I4CT), 2015,