A Comprehensive Performance Evaluation to GPGPU Applications under STT-RAM based Hybrid Cache Architectures

被引:0
|
作者
Fu, Jingjing [1 ]
Liu, Yu [1 ]
机构
[1] Clarkson Univ, Dept Elect & Comp Engn, 8 Clarkson Ave, Potsdam, NY 13699 USA
关键词
GPGPU; STT-RAM; Cache; Timing; Energy Consumption; Soft Error Resilience;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nowadays, general purpose Graphic Processing Units (GPGPUs) have become the technical trend for complex science and engineering computing in the exascale, which is through its unique capability of massive parallel computing based on the many-core architecture. Also, the occurrence probability of soft errors caused by particle strike on the large-scale computing system built by GPGPUs has been boosted significantly. Spin-Transfer Torque RAM (STT-RAM) benefits from its unique way of carrying information through a Magnetic Tunnel Junction (MTJ), and then it is a feasible soft error resilient solution due to its immunity to soft errors. However, STT-RAM suffers from the large overhead of latency and energy consumption on write operations, and thus results in hesitating of adopting STT-RAM into memory system design. Therefore, it is very necessary to do a comprehensive performance evaluation of adopting the STT-RAM into the memory hierarchy of the GPGPU architecture (i.e., hybrid STT-RAM/SRAM cache architectures). This work offers a fair and comprehensive performance evaluation for GPGPU applications based on different cache associativities and multiple plans of partial or complete adopting STT-RAM into the memory hierarchy of the GPGPU, which could offer useful options for the soft error resilient GPGPU architecture design. In addition, this work encloses that a proper combination of cache configuration and adoption plan may result in only slight timing performance drop and equivalent energy consumption performance, while taking advantage of the soft error resilience.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] SRAM- and STT-RAM-based hybrid, shared last-level cache for on-chip CPU-GPU heterogeneous architectures
    Gao, Lan
    Wang, Rui
    Xu, Yunlong
    Yang, Hailong
    Luan, Zhongzhi
    Qian, Depei
    Zhang, Han
    Cai, Jihong
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (07): : 3388 - 3414
  • [42] Migration-Aware Loop Retiming for STT-RAM-Based Hybrid Cache in Embedded Systems
    Qiu, Keni
    Zhao, Mengying
    Li, Qingan
    Fu, Chenchen
    Xue, Chun Jason
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2014, 33 (03) : 329 - 342
  • [43] Compiler-Assisted STT-RAM-Based Hybrid Cache for Energy Efficient Embedded Systems
    Li, Qingan
    Li, Jianhua
    Shi, Liang
    Zhao, Mengying
    Xue, Chun Jason
    He, Yanxiang
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2014, 22 (08) : 1829 - 1840
  • [44] A Cache Fill and Migration Policy for STT-RAM-Based Multi-Level Hybrid Cache in 3D CMPs
    Ge, Fen
    Wang, Lei
    Wu, Ning
    Zhou, Fang
    ELECTRONICS, 2019, 8 (06)
  • [45] BRLoop: Constructing balanced retimed loop to architect STT-RAM-based hybrid cache for VLIW processors
    Qiu, Keni
    Zhu, Yujie
    Xu, Yuanchao
    Huo, Qirun
    Xue, Chun Jason
    MICROELECTRONICS JOURNAL, 2019, 83 : 137 - 146
  • [46] A Variation-Aware Ternary Spin-Hall Assisted STT-RAM Based on Hybrid MTJ/GAA-CNTFET Logic
    Razi, Farzad
    Moaiyeri, Mohammad Hossein
    Rajaei, Ramin
    Mohammadi, Siamak
    IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2019, 18 : 598 - 605
  • [47] A Scaling Roadmap and Performance Evaluation of In-Plane and Perpendicular MTJ Based STT-MRAMs for High-Density Cache Memory
    Chun, Ki Chul
    Zhao, Hui
    Harms, Jonathan D.
    Kim, Tae-Hyoung
    Wang, Jian-Ping
    Kim, Chris H.
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2013, 48 (02) : 598 - 610
  • [48] Performance Evaluation of NFS-based Primary Storage with Deduplication using Windows Server and RAM-based Cache on Small-scale VMware Environment
    Marcel
    2018 INTERNATIONAL SEMINAR ON INTELLIGENT TECHNOLOGY AND ITS APPLICATIONS (ISITIA 2018), 2018, : 227 - 232
  • [49] Simulation-based evaluation of design options for high performance parallel architectures for space-borne applications
    Leonardi, A
    Schottdorf, B
    Bretschneider, T
    PDPTA '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS 1-3, 2004, : 114 - 118
  • [50] Comprehensive performance evaluation and optimization of hybrid power robot based on proton exchange membrane fuel cell
    Lu, Xueqin
    Deng, Ruiyu
    Li, Xinyu
    Wu, Yinbo
    INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 2022, 46 (02) : 1934 - 1950