A Scalable Architecture for Ordered Parallelism

被引:44
|
作者
Jeffrey, Mark C. [1 ]
Subramanian, Suvinay [1 ]
Yan, Cong [1 ]
Emer, Joel [2 ]
Sanchez, Daniel [1 ]
机构
[1] MIT, CSAIL, Cambridge, MA 02139 USA
[2] MIT, CSAIL, NVIDIA, Cambridge, MA 02139 USA
来源
PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48) | 2015年
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
Multicore; ordered parallelism; irregular parallelism; fine-grain parallelism; synchronization; speculative execution;
D O I
10.1145/2830772.2830777
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present Swarm, a novel architecture that exploits ordered irregular parallelism, which is abundant but hard to mine with current software and hardware techniques. In this architecture, programs consist of short tasks with programmer-specified timestamps. Swarm executes tasks speculatively and out of order, and efficiently speculates thousands of tasks ahead of the earliest active task to uncover ordered parallelism. Swarm builds on prior TLS and HTM schemes, and contributes several new techniques that allow it to scale to large core counts and speculation windows, including a new execution model, speculation-aware hardware task management, selective aborts, and scalable ordered commits. We evaluate Swarm on graph analytics, simulation, and database benchmarks. At 64 cores, Swarm achieves 51-122x speedups over a single-core system, and out-performs software-only parallel algorithms by 3-18x.
引用
收藏
页码:228 / 241
页数:14
相关论文
共 50 条
  • [1] A Scalable Architecture for Reprioritizing Ordered Parallelism
    Posluns, Gilead
    Zhu, Yan
    Zhang, Guowei
    Jeffrey, Mark C.
    PROCEEDINGS OF THE 2022 THE 49TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '22), 2022, : 437 - 453
  • [2] A turbo decoder architecture with scalable parallelism
    Engin, N
    2004 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS DESIGN AND IMPLEMENTATION, PROCEEDINGS, 2004, : 298 - 303
  • [3] UNLOCKING ORDERED PARALLELISM WITH THE SWARM ARCHITECTURE
    Jeffrey, Mark C.
    Subramanian, Suvinay
    Yan, Cong
    Emer, Joel
    Sanchez, Daniel
    IEEE MICRO, 2016, 36 (03) : 105 - 117
  • [4] SCALABLE PARALLELISM BY EVOLUTIONARY ALGORITHMS
    HOFFMEISTER, F
    LECTURE NOTES IN ECONOMICS AND MATHEMATICAL SYSTEMS, 1991, 367 : 177 - 198
  • [5] Scalable Coordination of Hierarchical Parallelism
    Devadas, Vinay
    Curtis-Maury, Matthew
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [6] DAOS - Scalable And-Or parallelism
    Castro, LF
    Costa, VS
    Geyer, CFR
    Silva, F
    Vargas, PK
    Correia, ME
    EURO-PAR'99: PARALLEL PROCESSING, 1999, 1685 : 899 - 908
  • [7] Implicit Parallelism with Ordered Transactions
    von Praun, Christoph
    Ceze, Luis
    Cascaval, Calin
    PROCEEDINGS OF THE 2007 ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING PPOPP'07, 2007, : 79 - 89
  • [8] Scalable instruction-level parallelism
    Jesshope, C
    COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, 2004, 3133 : 383 - 392
  • [9] Scalable heterogeneous parallelism for atmospheric modeling and simulation
    Linford, John C.
    Sandu, Adrian
    JOURNAL OF SUPERCOMPUTING, 2011, 56 (03): : 300 - 327
  • [10] Support for scalable video using parallelism with merging
    Watkins, AR
    STORAGE AND RETRIEVAL FOR STILL IMAGE AND VIDEO DATABASES IV, 1996, 2670 : 274 - 285