An Efficient Vectorization Approach to Nested Thread-level Parallelism for CUDA GPUs

被引:0
|
作者
Xu, Shixiong [1 ]
Gregg, David [2 ]
机构
[1] Univ Dublin, Trinity Coll Dublin, Sch Comp Sci & Stat, Software Tools Grp, Dublin, Ireland
[2] Lero Irish Software Engn Res Ctr, Copenhagen, Denmark
来源
2015 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION (PACT) | 2015年
关键词
D O I
10.1109/PACT.2015.56
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
引用
收藏
页码:488 / 489
页数:2
相关论文
共 50 条
  • [21] Compiler-Driven Software Speculation for Thread-Level Parallelism
    Yiapanis, Paraskevas
    Brown, Gavin
    Lujan, Mikel
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2016, 38 (02):
  • [22] On the limitations of compilers to exploit thread-level parallelism in embedded applications
    Islam, Mafijul
    6TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2007, : 60 - 65
  • [23] Parallelization Spectroscopy: Analysis of Thread-level Parallelism in HPC Programs
    Kejariwal, Arun
    Cascaval, Calin
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 293 - 294
  • [24] Programming Matrix Algorithms-by-Blocks for Thread-Level Parallelism
    Quintana-Orti, Gregorio
    Quintana-Orti, Enrique S.
    Van de Geijn, Robert A.
    Van Zee, Field G.
    Chan, Ernie
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2009, 36 (03):
  • [25] Exploiting thread-level speculative parallelism with software value prediction
    Li, XF
    Yang, C
    Du, ZH
    Ngai, TF
    ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, PROCEEDINGS, 2005, 3740 : 367 - 388
  • [26] Parallelization spectroscopy: Analysis of thread-level parallelism in HPC programs
    Kejariwal, Arun
    Cascaval, Calin
    ACM SIGPLAN Notices, 2009, 44 (04): : 293 - 294
  • [27] Relational profiling: Enabling thread-level parallelism in virtual machines
    Heil, T
    Smith, JE
    33RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE: MICRO-33 2000, PROCEEDINGS, 2000, : 281 - 290
  • [28] Exploiting the thread-level parallelism for BGP on Multi-core
    Gao Lei
    Lai Mingche
    Gong Zhenghu
    CNSR 2008: PROCEEDINGS OF THE 6TH ANNUAL COMMUNICATION NETWORKS AND SERVICES RESEARCH CONFERENCE, 2008, : 510 - 516
  • [29] A scalable approach to Thread-Level Speculation
    Steffan, JG
    Colohan, CB
    Zhai, A
    Mowry, TC
    PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2000, : 1 - 12
  • [30] GODSON-T: AN EFFICIENT MANY-CORE PROCESSOR EXPLORING THREAD-LEVEL PARALLELISM
    Fan, Dongrui
    Zhang, Hao
    Wang, Da
    Ye, Xiaochun
    Song, Fenglong
    Li, Guojie
    Sun, Ninghui
    IEEE MICRO, 2012, 32 (02) : 38 - 47