Hardware and Software Co-Design for Optimized Decoding Schemes and Application Mapping in NVM Compute-in-Memory Architectures

被引:0
|
作者
Siddaramu, Shanmukha Mangadahalli [1 ]
Nezhadi, Ali [1 ]
Mayahinia, Mahta [1 ]
Ghasemi, Seyedehmaryam [1 ]
Tahoori, Mehdi B. [1 ]
机构
[1] Karlsruhe Inst Technol, Dept Comp Sci, D-76131 Karlsruhe, Germany
关键词
Power demand; Nonvolatile memory; System performance; Systems architecture; Data processing; Software; Decoding; Sensors; Arrays; Optimization; Binary tree data structure; computation-in-memory (CiM); decoder; gem5; latch;
D O I
10.1109/TCAD.2024.3447216
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The computation-in nonvolatile memory (NVM-CiM) approach addresses the growing computational demands and the memory-wall problem faced by traditional processor-centric architectures. Computation-in-memory (CiM) capitalizes on the parallel nature of memory arrays enabling effective computation through multirow memristor reading and sensing. In this context, the conventional design of memory decoders needs to be accordingly modified for efficient multirow activation and parallel data processing. This article presents the design and optimization of address decoders for NVM-CiM system architectures, employing a cross-layer co-optimization approach that integrates circuit and architecture design with application requirements. Our methodology starts at the circuit level, examining various decoder designs, including cascaded, hierarchical, latched, and hybrid models. An in-depth application-level characterization follows, utilizing an extended NVM-CiM-capable gem5 simulator to assess the impact of these decoders on the mapping of CiM-friendly applications and the resulting system performance, particularly in facilitating rapid and efficient activation of multirow memory configurations. This holistic analysis allows us to identify the bottlenecks and requirements from the application side and adjust the design of the decoder accordingly. Our analysis reveals that Hybrid Decoders significantly decrease latency and power consumption compared to other decoder designs within NVM-CiM systems. This highlights the crucial role of the decoder's row selection flexibility, reducing additional system-level data movement even at the expense of its performance, can substantially improve the overall efficiency of NVM-CiM systems.
引用
收藏
页码:3744 / 3755
页数:12
相关论文
共 47 条
  • [21] Operating System Support to an Online Hardware-Software Co-Design Scheduler for Heterogeneous Multicore Architectures
    Bueno, Maikon A. F.
    de Holanda, Jose A. M.
    Pereira, Erinaldo
    Marques, Eduardo
    2014 IEEE 20TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA), 2014,
  • [22] Hardware/Software Co-Design With ADC-Less In-Memory Computing Hardware for Spiking Neural Networks
    Apolinario, Marco Paul E.
    Kosta, Adarsh Kumar
    Saxena, Utkarsh
    Roy, Kaushik
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (01) : 35 - 47
  • [23] ParaGraph: An application-simulator interface and toolkit for hardware-software co-design
    Isaev, Mikhail
    McDonald, Nic
    Young, Jeffrey
    Vuduc, Richard
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
  • [25] GANDALF: A Fine-Grained Hardware-Software Co-Design for Preventing Memory Attacks
    Krishnakumar, Gnanambikai
    Patanjali, S. L. P. S. K.
    Vairam, Prasanna Karthik
    Rebeiro, Chester
    Veezhinathan, Kamakoti
    IEEE EMBEDDED SYSTEMS LETTERS, 2018, 10 (03) : 83 - 86
  • [26] TransPIM: A Memory-based Acceleration via Software-Hardware Co-Design for Transformer
    Zhou, Minxuan
    Xu, Weihong
    Kang, Jaeyoung
    Rosing, Tajana
    2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 1071 - 1085
  • [27] Exploring Hybrid Memory for GPU Energy Efficiency through Software-Hardware Co-Design
    Wang, Bin
    Wu, Bo
    Li, Dong
    Shen, Xipeng
    Yu, Weikuan
    Jiao, Yizheng
    Vetter, Jeffrey S.
    2013 22ND INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT), 2013, : 93 - 102
  • [28] Optimized Hardware-Software Co-Design for Kyber and Dilithium on RISC-V SoC FPGA
    Wang, Tengfei
    Zhang, Chi
    Zhang, Xiaolin
    Gu, Dawu
    Cao, Pei
    IACR Transactions on Cryptographic Hardware and Embedded Systems, 2024, 2024 (03): : 99 - 135
  • [29] ML-HW Co-Design of Noise-Robust TinyML Models and Always-On Analog Compute-in-Memory Edge Accelerator
    Zhou, Chuteng
    Redondo, Fernando Garcia
    Buchel, Julian
    Boybat, Irem
    Comas, Xavier Timoneda
    Nandakumar, S. R.
    Das, Shidhartha
    Sebastian, Abu
    Le Gallo, Manuel
    Whatmough, Paul N.
    IEEE MICRO, 2022, 42 (06) : 76 - 87
  • [30] The Use of Fair Y-Sim for Optimizing Mapping Set Selection in Hardware/Software Co-Design
    Adeluyi, Olufemi
    Kim, Eun-ok
    Lee, Jeong-A
    Lee, Jeong-Gun
    ISOCC: 2008 INTERNATIONAL SOC DESIGN CONFERENCE, VOLS 1-3, 2008, : 613 - +