Efficient Utilization of SIMD Engines for General-Purpose Processors

被引:1
|
作者
Huang, Libo [1 ,2 ]
Wang, Zhiying [1 ,2 ]
Xiao, Nong [1 ,2 ]
Dou, Qiang [2 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
来源
COMPUTER JOURNAL | 2014年 / 57卷 / 08期
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
SIMD engine; general purpose processor; utilization; MULTIMEDIA; PERFORMANCE;
D O I
10.1093/comjnl/bxt049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes stream model execution (SME), a new architectural technique that supports the efficient utilization of single-instruction multiple-data (SIMD) engines on general-purpose processors. SME adopts lightweight streaming processing as the intermediate representation. This process uses a hardware-software co-design approach to support efficient SIMD compilation and utilization. From a hardware perspective, the micro-architecture of SIMD engines is enabled for stream support to provide the key components to address the limitations of SIMD utilization, such as data rearrangement and non-consecutive memory access. Thus, sequential code can be compiled efficiently through a two-step procedure (i.e. Ccode -> SMECcode -> vectorized code) and then executed by the SME hardware. Our experimental evaluation shows that SME can outperform previous optimization techniques by an average of 58% for multimedia kernels and 26% for applications.
引用
收藏
页码:1141 / 1154
页数:14
相关论文
共 50 条
  • [1] SparseTrain: Leveraging Dynamic Sparsity in Software for Training DNNs on General-Purpose SIMD Processors
    Gong, Zhangxiaowen
    Ji, Houxiang
    Fletcher, Christopher W.
    Hughes, Christopher J.
    Torrellas, Josep
    PACT '20: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2020, : 279 - 292
  • [2] Novel Efficient HEVC Decoding Solution on General-Purpose Processors
    Duan, Yizhou
    Sun, Jun
    Yan, Leju
    Chen, Keji
    Guo, Zongming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (07) : 1915 - 1928
  • [3] Implementing the 2-D wavelet transform on SIMD-enhanced general-purpose processors
    Shahbahrami, Asadollah
    Juurlink, Ben
    Vassiliadis, Stamatis
    IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (01) : 43 - 51
  • [4] Performance aspects of using various techniques of programming SIMD extensions of modern general-purpose processors
    Trocki, Krzysztof
    PROCEEDINGS OF THE 2008 1ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, 2008, : 413 - 416
  • [5] Multimedia extensions for general-purpose processors
    Lee, RB
    SIPS 97 - 1997 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS: DESIGN AND IMPLEMENTATION, 1997, : 9 - 23
  • [6] Multimedia enhanced general-purpose processors
    Wong, S
    Cotofana, S
    Vassiliadis, S
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1493 - 1496
  • [7] Memory Encryption for General-Purpose Processors
    Gueron, Shay
    IEEE SECURITY & PRIVACY, 2016, 14 (06) : 54 - 62
  • [9] Stream programming on general-purpose processors
    Gummaraju, J
    Rosenblum, M
    MICRO-38: Proceedings of the 38th Annual IEEE/ACM International Symposiumn on Microarchitecture, 2005, : 343 - 354
  • [10] GENERAL-PURPOSE TECHNOLOGIES - ENGINES OF GROWTH
    BRESNAHAN, TF
    TRAJTENBERG, M
    JOURNAL OF ECONOMETRICS, 1995, 65 (01) : 83 - 108