Efficient Utilization of SIMD Engines for General-Purpose Processors

被引:1
|
作者
Huang, Libo [1 ,2 ]
Wang, Zhiying [1 ,2 ]
Xiao, Nong [1 ,2 ]
Dou, Qiang [2 ]
机构
[1] Natl Univ Def Technol, State Key Lab High Performance Comp, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp, Changsha 410073, Hunan, Peoples R China
来源
COMPUTER JOURNAL | 2014年 / 57卷 / 08期
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
SIMD engine; general purpose processor; utilization; MULTIMEDIA; PERFORMANCE;
D O I
10.1093/comjnl/bxt049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes stream model execution (SME), a new architectural technique that supports the efficient utilization of single-instruction multiple-data (SIMD) engines on general-purpose processors. SME adopts lightweight streaming processing as the intermediate representation. This process uses a hardware-software co-design approach to support efficient SIMD compilation and utilization. From a hardware perspective, the micro-architecture of SIMD engines is enabled for stream support to provide the key components to address the limitations of SIMD utilization, such as data rearrangement and non-consecutive memory access. Thus, sequential code can be compiled efficiently through a two-step procedure (i.e. Ccode -> SMECcode -> vectorized code) and then executed by the SME hardware. Our experimental evaluation shows that SME can outperform previous optimization techniques by an average of 58% for multimedia kernels and 26% for applications.
引用
收藏
页码:1141 / 1154
页数:14
相关论文
共 50 条
  • [31] Benchmark Research on Computing Performance of Domestic General-purpose Processors
    Liu, Bowen
    Fan, Kefeng
    Chen, Hai
    Liu, Lixin
    Yang, Huihua
    2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 357 - 362
  • [32] FUZZY RULE-BASED SYSTEMS ON GENERAL-PURPOSE PROCESSORS
    SURMANN, H
    UNGERING, AP
    IEEE MICRO, 1995, 15 (04) : 40 - 48
  • [33] FASTCRYPTO: PARALLEL AES PIPELINES EXTENSION FOR GENERAL-PURPOSE PROCESSORS
    Soliman, Mostafa I.
    Abozaid, Ghada Y.
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 937 - 944
  • [34] Engines of power: Electricity, AI, and general-purpose, military transformations
    Ding, Jeffrey
    Dafoe, Allan
    EUROPEAN JOURNAL OF INTERNATIONAL SECURITY, 2023, 8 (03) : 377 - 394
  • [35] Web service discovery using general-purpose search engines
    Song, Henry
    Cheng, Doreen
    Messer, Alan
    Kalasapur, Swaroop
    2007 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2007, : 265 - +
  • [36] A general-purpose processor-per-pixel analog SIMD vision chip
    Dudek, P
    Hicks, PJ
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2005, 52 (01) : 13 - 20
  • [37] EFFICIENT GENERAL-PURPOSE PARALLEL COMPUTER.
    Galil, Zvi
    Paul, Wolfang J.
    Journal of the ACM, 1983, 30 (02): : 360 - 387
  • [38] Novel Efficient HEVC Decoding Solution on General-Purpose Processors (vol 16, pg 1915, 2014)
    Duan, Yizhou
    Sun, Jun
    Yan, Leju
    Chen, Keji
    Guo, Zongming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (02) : 257 - 257
  • [39] Attaining High Performance in General-Purpose Computations on Current Graphics Processors
    Igual, Francisco D.
    Mayo, Rafael
    Quintana-Orti, Enrique S.
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2008, 2008, 5336 : 406 - 419
  • [40] An Overflow-free Quantized Memory Hierarchy in General-purpose Processors
    Lenjani, Marzieh
    Gonzalez, Patricia
    Sadredini, Elaheh
    Rahman, M. Arif
    Stan, Mircea R.
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC 2019), 2019, : 203 - 215