Hardware and Software Co-Design for Optimized Decoding Schemes and Application Mapping in NVM Compute-in-Memory Architectures

被引:0
|
作者
Siddaramu, Shanmukha Mangadahalli [1 ]
Nezhadi, Ali [1 ]
Mayahinia, Mahta [1 ]
Ghasemi, Seyedehmaryam [1 ]
Tahoori, Mehdi B. [1 ]
机构
[1] Karlsruhe Inst Technol, Dept Comp Sci, D-76131 Karlsruhe, Germany
关键词
Power demand; Nonvolatile memory; System performance; Systems architecture; Data processing; Software; Decoding; Sensors; Arrays; Optimization; Binary tree data structure; computation-in-memory (CiM); decoder; gem5; latch;
D O I
10.1109/TCAD.2024.3447216
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The computation-in nonvolatile memory (NVM-CiM) approach addresses the growing computational demands and the memory-wall problem faced by traditional processor-centric architectures. Computation-in-memory (CiM) capitalizes on the parallel nature of memory arrays enabling effective computation through multirow memristor reading and sensing. In this context, the conventional design of memory decoders needs to be accordingly modified for efficient multirow activation and parallel data processing. This article presents the design and optimization of address decoders for NVM-CiM system architectures, employing a cross-layer co-optimization approach that integrates circuit and architecture design with application requirements. Our methodology starts at the circuit level, examining various decoder designs, including cascaded, hierarchical, latched, and hybrid models. An in-depth application-level characterization follows, utilizing an extended NVM-CiM-capable gem5 simulator to assess the impact of these decoders on the mapping of CiM-friendly applications and the resulting system performance, particularly in facilitating rapid and efficient activation of multirow memory configurations. This holistic analysis allows us to identify the bottlenecks and requirements from the application side and adjust the design of the decoder accordingly. Our analysis reveals that Hybrid Decoders significantly decrease latency and power consumption compared to other decoder designs within NVM-CiM systems. This highlights the crucial role of the decoder's row selection flexibility, reducing additional system-level data movement even at the expense of its performance, can substantially improve the overall efficiency of NVM-CiM systems.
引用
收藏
页码:3744 / 3755
页数:12
相关论文
共 47 条
  • [31] AN FPGA HARDWARE/SOFTWARE CO-DESIGN TOWARDS EVOLVABLE SPIKING NEURAL NETWORKS FOR ROBOTICS APPLICATION
    Johnston, S. P.
    Prasad, G.
    Maguire, L.
    Mcginnity, T. M.
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2010, 20 (06) : 447 - 461
  • [32] Hardware/software co-design in the rapid prototyping of application-specific signal processors methodology
    Schaming, WB
    VHDL INTERNATIONAL USERS' FORUM, PROCEEDINGS, 1997, : 241 - 250
  • [33] A CONFLICT-FREE MEMORY MAPPING APPROACH TO DESIGN PARALLEL HARDWARE INTERLEAVER ARCHITECTURES WITH OPTIMIZED NETWORK AND CONTROLLER
    Briki, Aroua
    Chavet, Cyrille
    Coussy, Philippe
    2013 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2013, : 201 - 206
  • [34] MELOPPR: Software/Hardware Co-design for Memory-efficient Low-latency Personalized PageRank
    Li, Lixiang
    Chen, Yao
    Zirnheld, Zacharie
    Li, Pan
    Hao, Cong
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 601 - 606
  • [35] A Hardware-Software Co-design Approach to a JPEG Encoder Design for a Planetary Micro-Rover Application
    Sarma, S.
    Udupa, S.
    Bhardwaj, K. M.
    Parameswaran, K.
    Malik, N. K.
    INTELLIGENT ROBOTS AND COMPUTER VISION XXVIII: ALGORITHMS AND TECHNIQUES, 2011, 7878
  • [36] Software/Hardware Co-design of 3D NoC-based GPU Architectures for Accelerated Graph Computations
    Choudhury, Dwaipayan
    Barik, Reet
    Rajam, Aravind Sukumaran
    Kalyanaraman, Ananth
    Pande, Partha Pratim
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (06)
  • [37] Hardware-Software Co-Design of an Embedded Power Management Module with Adaptive On-Chip Power Processing Schemes
    Bondade, Rajdeep
    Ma, Dongsheng
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 617 - 620
  • [38] Hardware/Software Co-Design for TinyML Voice-Recognition Application on Resource Frugal Edge Devices
    Kwon, Jisu
    Park, Daejin
    APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [39] Multi-Tag: A Hardware-Software Co-Design for Memory Safety based on Multi-Granular Memory Tagging
    Unterguggenberger, Martin
    Schrammel, David
    Nasahl, Pascal
    Schilling, Robert
    Lamster, Lukas
    Mangard, Stefan
    PROCEEDINGS OF THE 2023 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ASIA CCS 2023, 2023, : 177 - 189
  • [40] A model-based embedded control hardware/software co-design approach for optimized sensor selection of industrial systems
    Deliparaschos, Kyriakos M.
    Michail, Konstantinos
    Tzafestas, Spyros G.
    Zolotas, Argyrios C.
    2015 23RD MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2015, : 889 - 894