Hardware and Software Co-Design for Optimized Decoding Schemes and Application Mapping in NVM Compute-in-Memory Architectures

被引:0
|
作者
Siddaramu, Shanmukha Mangadahalli [1 ]
Nezhadi, Ali [1 ]
Mayahinia, Mahta [1 ]
Ghasemi, Seyedehmaryam [1 ]
Tahoori, Mehdi B. [1 ]
机构
[1] Karlsruhe Inst Technol, Dept Comp Sci, D-76131 Karlsruhe, Germany
关键词
Power demand; Nonvolatile memory; System performance; Systems architecture; Data processing; Software; Decoding; Sensors; Arrays; Optimization; Binary tree data structure; computation-in-memory (CiM); decoder; gem5; latch;
D O I
10.1109/TCAD.2024.3447216
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The computation-in nonvolatile memory (NVM-CiM) approach addresses the growing computational demands and the memory-wall problem faced by traditional processor-centric architectures. Computation-in-memory (CiM) capitalizes on the parallel nature of memory arrays enabling effective computation through multirow memristor reading and sensing. In this context, the conventional design of memory decoders needs to be accordingly modified for efficient multirow activation and parallel data processing. This article presents the design and optimization of address decoders for NVM-CiM system architectures, employing a cross-layer co-optimization approach that integrates circuit and architecture design with application requirements. Our methodology starts at the circuit level, examining various decoder designs, including cascaded, hierarchical, latched, and hybrid models. An in-depth application-level characterization follows, utilizing an extended NVM-CiM-capable gem5 simulator to assess the impact of these decoders on the mapping of CiM-friendly applications and the resulting system performance, particularly in facilitating rapid and efficient activation of multirow memory configurations. This holistic analysis allows us to identify the bottlenecks and requirements from the application side and adjust the design of the decoder accordingly. Our analysis reveals that Hybrid Decoders significantly decrease latency and power consumption compared to other decoder designs within NVM-CiM systems. This highlights the crucial role of the decoder's row selection flexibility, reducing additional system-level data movement even at the expense of its performance, can substantially improve the overall efficiency of NVM-CiM systems.
引用
收藏
页码:3744 / 3755
页数:12
相关论文
共 47 条
  • [41] RAINBOW: Multi-Dimensional Hardware-Software Co-Design for DL Accelerator On-Chip Memory
    Zouzoula, Stavroula
    Azhar, Muhammad Waqar
    Trancoso, Pedro
    2023 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, ISPASS, 2023, : 352 - 354
  • [42] Packing Multiple Types of Cores for Energy-Optimized Heterogeneous Hardware-Software Co-Design of Moldable Streaming Computations
    Litzinger, Sebastian
    Keller, Jorg
    Kessler, Christoph
    IEEE ACCESS, 2023, 11 : 19301 - 19311
  • [43] Optimizing hardware-software co-design based on non-ideality in memristor crossbars for in-memory computing
    Jiang, Pinfeng
    Song, Danzhe
    Huang, Menghua
    Yang, Fan
    Wang, Letian
    Liu, Pan
    Miao, Xiangshui
    Wang, Xingsheng
    SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (02)
  • [44] Optimizing hardware-software co-design based on non-ideality in memristor crossbars for in-memory computing
    Pinfeng JIANG
    Danzhe SONG
    Menghua HUANG
    Fan YANG
    Letian WANG
    Pan LIU
    Xiangshui MIAO
    Xingsheng WANG
    Science China(Information Sciences), 2025, 68 (02) : 354 - 369
  • [45] Prodigy: Improving the Memory Latency of Data-Indirect Irregular Workloads Using Hardware-Software Co-Design
    Talati, Nishil
    May, Kyle
    Behroozi, Armand
    Yang, Yichen
    Kaszyk, Kuba
    Vasiladiotis, Christos
    Verma, Tarunesh
    Li, Lu
    Nguyen, Brandon
    Sun, Jiawen
    Morton, John Magnus
    Ahmadi, Agreen
    Austin, Todd
    O'Boyle, Michael
    Mahlke, Scott
    Mudge, Trevor
    Dreslinski, Ronald
    2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 654 - 667
  • [46] SpinBayes: Algorithm-Hardware Co-Design for Uncertainty Estimation Using Bayesian In-Memory Approximation on Spintronic-Based Architectures
    Ahmed, Soyed Tuhin
    Danouchi, Kamal
    Hefenbrock, Michael
    Prenat, Guillaume
    Anghel, Lorena
    Tahoori, Mehdi B.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (05)
  • [47] Co-design of application software and NAND flash memory in solid-state drive for relational database storage system
    Miyaji, Kousuke
    Sun, Chao
    Soga, Ayumi
    Takeuchi, Ken
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2014, 53 (04)