Efficient Utilization of SIMD Engines for General-Purpose Processors

被引:1
|
作者
Huang, Libo [1 ,2 ]
Wang, Zhiying [1 ,2 ]
Xiao, Nong [1 ,2 ]
Dou, Qiang [2 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
来源
COMPUTER JOURNAL | 2014年 / 57卷 / 08期
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
SIMD engine; general purpose processor; utilization; MULTIMEDIA; PERFORMANCE;
D O I
10.1093/comjnl/bxt049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes stream model execution (SME), a new architectural technique that supports the efficient utilization of single-instruction multiple-data (SIMD) engines on general-purpose processors. SME adopts lightweight streaming processing as the intermediate representation. This process uses a hardware-software co-design approach to support efficient SIMD compilation and utilization. From a hardware perspective, the micro-architecture of SIMD engines is enabled for stream support to provide the key components to address the limitations of SIMD utilization, such as data rearrangement and non-consecutive memory access. Thus, sequential code can be compiled efficiently through a two-step procedure (i.e. Ccode -> SMECcode -> vectorized code) and then executed by the SME hardware. Our experimental evaluation shows that SME can outperform previous optimization techniques by an average of 58% for multimedia kernels and 26% for applications.
引用
收藏
页码:1141 / 1154
页数:14
相关论文
共 50 条
  • [41] Mat-core: A matrix core extension for general-purpose processors
    Soliman, Mostafa I.
    2007 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS: ICCES '07, 2007, : 304 - 310
  • [42] Work in Progress - Course Development of Programming for General-Purpose Multicore Processors
    Zhang, Wei
    2011 FRONTIERS IN EDUCATION CONFERENCE (FIE), 2011,
  • [43] A Tensor Marshaling Unit for Sparse Tensor Algebra on General-Purpose Processors
    Siracusa, Marco
    Soria-Pardos, Victor
    Sgherzi, Francesco
    Randall, Joshua
    Joseph, Douglas J.
    Moreto Planas, Miquel
    Armejach, Adria
    56TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2023, 2023, : 1332 - 1346
  • [44] GENERAL-PURPOSE OPTICAL-PATTERN RECOGNITION IMAGE-PROCESSORS
    CASASENT, D
    PROCEEDINGS OF THE IEEE, 1994, 82 (11) : 1724 - 1734
  • [45] A performance study of general-purpose applications on graphics processors using CUDA
    Che, Shuai
    Boyer, Michael
    Meng, Jiayuan
    Tarjan, David
    Sheaffer, Jeremy W.
    Skadron, Kevin
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (10) : 1370 - 1380
  • [46] General-purpose 128 x 128 SIMD processor array with integrated image sensor
    Dudek, P.
    Carey, S. J.
    ELECTRONICS LETTERS, 2006, 42 (12) : 678 - 679
  • [47] Compilation, architectural support, and evaluation of SIMD graphics pipeline programs on a general-purpose CPU
    Breternitz, M
    Hum, H
    Kumar, S
    12TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2003, : 135 - 145
  • [48] General-Purpose FPGA Platform for Efficient Encryption and Hashing
    Szefer, Jakub
    Chen, Yu-Yuan
    Lee, Ruby B.
    21ST IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2010,
  • [49] Runtime reconfiguration techniques for efficient general-purpose computation
    Xu, BX
    Albonesi, DH
    IEEE DESIGN & TEST OF COMPUTERS, 2000, 17 (01): : 42 - 52
  • [50] An Efficient Formulation for General-Purpose Multibody/Multiphysics Analysis
    Masarati, Pierangelo
    Morandini, Marco
    Mantegazza, Paolo
    JOURNAL OF COMPUTATIONAL AND NONLINEAR DYNAMICS, 2014, 9 (04):