Exploiting SIMD Asymmetry in ARM-to-x86 Dynamic Binary Translation

被引:6
|
作者
Liu, Yu-Ping [1 ]
Hong, Ding-Yong [2 ]
Wu, Jan-Jan [2 ]
Fu, Sheng-Yu [1 ]
Hsu, Wei-Chung [1 ]
机构
[1] Natl Taiwan Univ, 1,Sect 4,Roosevelt Rd, Taipei 10617, Taiwan
[2] Acad Sinica, 128,Sect 2,Acad Rd, Taipei 11529, Taiwan
关键词
Dynamic binary translation; SIMD; SLP vectorization;
D O I
10.1145/3301488
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Single instruction multiple data (SIMD) has been adopted for decades because of its superior performance and power efficiency. The SIMD capability (i.e., width, number of registers, and advanced instructions) has diverged rapidly on different SIMD instruction-set architectures (ISAs). Therefore, migrating existing applications to another host ISA that has fewer but longer SIMD registers and more advanced instructions raises the issues of asymmetric SIMD capability. To date, this issue has been overlooked and the host SIMD capability is underutilized, resulting in suboptimal performance. In this article, we present a novel binary translation technique called spill-aware superword level parallelism (saSLP), which combines short ARMv8 instructions and registers in the guest binaries to exploit the x86 AVX2 host's parallelism, register capacity, and gather instructions. Our experiment results show that saSLP improves the performance by 1.6x (2.3x) across a number of benchmarks and reduces spilling by 97% (99%) for ARMv8 to x86 AVX2 (AVX-512) translation. Furthermore, with AVX2 (AVX-512) gather instructions, saSLP speeds up several data-irregular applications that cannot be vectorized on ARMv8 NEON by up to 3.9x (4.2x).
引用
收藏
页数:24
相关论文
共 12 条
  • [1] Exploiting Asymmetric SIMD Register Configurations in ARM-to-x86 Dynamic Binary Translation
    Liu, Yu-Ping
    Hong, Ding-Yong
    Wu, Jan-Jan
    Fu, Sheng-Yu
    Hsu, Wei-Chung
    2017 26TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2017, : 343 - 355
  • [2] Exploiting Longer SIMD Lanes in Dynamic Binary Translation
    Hong, Ding-Yong
    Fu, Sheng-Yu
    Liu, Yu-Ping
    Wu, Jan-Jan
    Hsu, Wei-Chung
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 853 - 860
  • [3] Optimizing dynamic binary translation for SIMD instructions
    Li, Jianhui
    Zhang, Qi
    Xu, Shu
    Huang, Bo
    CGO 2006: 4TH INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2006, : 269 - +
  • [4] Efficient and retargetable SIMD translation in a dynamic binary translator
    Fu, Sheng-Yu
    Hong, Ding-Yong
    Liu, Yu-Ping
    Wu, Jan-Jan
    Hsu, Wei-Chung
    SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (06): : 1312 - 1330
  • [5] Improving SIMD Parallelism via Dynamic Binary Translation
    Hong, Ding-Yong
    Liu, Yu-Ping
    Fu, Sheng-Yu
    Wu, Jan-Jan
    Hsu, Wei-Chung
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (03)
  • [6] Exploiting Vector Processing in Dynamic Binary Translation
    Lin, Chih-Min
    Fu, Sheng-Yu
    Hong, Ding-Yong
    Liu, Yu-Ping
    Wu, Jan-Jan
    Hsu, Wei-Chung
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [7] Low Overhead Dynamic Binary Translation on ARM
    d'Antras, Amanieu
    Gorgovan, Cosmin
    Garside, Jim
    Lujan, Mikel
    ACM SIGPLAN NOTICES, 2017, 52 (06) : 333 - 346
  • [8] Disposing X86 FPU stack in binary translation
    Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China
    不详
    Jisuanji Yanjiu yu Fazhan, 2007, 11 (1946-1954):
  • [9] Speeding-up SIMD instructions Dynamic Binary Translation in Embedded Processor Simulation
    Michel, Luc
    Fournel, Nicolas
    Petrot, Frederic
    2011 DESIGN, AUTOMATION & TEST IN EUROPE (DATE), 2011, : 277 - 280
  • [10] Work-in-Progress: Exploiting SIMD Capability in an ARMv7-to-ARMv8 Dynamic Binary Translator
    Fu, Sheng-Yu
    Lin, Chih-Min
    Hong, Ding-Yong
    Liu, Yu-Ping
    Wu, Jan-Jan
    Hsu, Wei-Chung
    2018 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2018,