Multi-Stage Tunable Approximate Search in Resistive Associative Memory

被引:10
|
作者
Imani, Mohsen [1 ]
Rahimi, Abbas [2 ]
Mercati, Pietro [1 ]
Rosing, Tajana Simunic [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Associative memory; approximate computing; resistive memory; GPUs; ternary content addressable memory (TCAM); non-volatile memory;
D O I
10.1109/TMSCS.2017.2665462
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
General-purpose graphics processing units (GPGPUs), as programmable accelerators, improve energy efficiency by integrating a large number of relatively small cores. In this paper, we focus on improving energy efficiency of such processing core by integrating an associative memory where function responses are prestored. Associative memories can search and recall function responses for a subset of input values therefore avoiding the actual function execution on the processing core that leads to energy saving. We propose a novel low-energy Resistive Multi-stage Associative Memory (ReMAM) architecture to significantly reduce energy of a search operation by employing selective row activation and in-advance precharging techniques. ReMAM splits the search operations in a ternary content addressable memory (TCAM) to a number of shorter searches in consecutive stages. Then, it selectively activates TCAM rows at each stage based on the hits of previous stages, thus enabling energy savings. The proposed inadvance precharging technique mitigates the delay of the sequential TCAM search and limits the number of precharges to two low-cost steps. ReMAM further implements approximation on the selective TCAM blocks to reduce the search energy that relaxes the function output in a fine-grained granularity with very low impact on accuracy of the results. Its multi-stage search operation makes ReMAM applicable to many applications such as search engines, sorting, image coding, pattern recognition, query processing, and machine learning. In this work, we show an application of proposed ReMAM on AMD Southern Island GPUs. Our experimental evaluation shows that ReMAM reduces on average GPGPU energy consumption by 35 percent in the exact mode, and 58 percent in approximate mode with average relative error lower than 10 percent. These energy savings are 1.8 x and 1.5 x higher than state-of-the-art associative memories used in GPGPUs in exact and approximate modes.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 50 条
  • [1] ReMAM: Low Energy Resistive Multi-Stage Associative Memory for Energy Efficient Computing
    Imani, Mohsen
    Mercati, Pietro
    Rosing, Tajana
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN ISQED 2016, 2016, : 101 - 106
  • [2] Resistive Configurable Associative Memory for Approximate Computing
    Imani, Mohsen
    Rahimi, Abbas
    Rosing, Tajana S.
    PROCEEDINGS OF THE 2016 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2016, : 1327 - 1332
  • [3] A Generalization of the Multi-Stage Search Allocation Game
    Hohzaki, Ryusuke
    ADVANCES IN DYNAMIC GAMES: THEORY, APPLICATIONS, AND NUMERICAL METHODS FOR DIFFERENTIAL AND STOCHASTIC GAMES: DEDICATED TO THE MEMORY OF ARIK A. MELIKYAN, 2011, 11 : 189 - 207
  • [4] On Approximate Nearest Neighbour Selection for Multi-Stage Dense Retrieval
    Macdonald, Craig
    Tonellotto, Nicola
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3318 - 3322
  • [5] SEARCH OF ASSOCIATIVE MEMORY
    RAAIJMAKERS, JGW
    SHIFFRIN, RM
    PSYCHOLOGICAL REVIEW, 1981, 88 (02) : 93 - 134
  • [6] Multi-stage Rocket Mission Decision Based on Approximate Dynamic Programming
    Li, Chaobing
    Bao, Weimin
    Li, Zhongkui
    Yu, Chunmei
    Cheng, Xiaoming
    Yuhang Xuebao/Journal of Astronautics, 2024, 45 (08): : 1251 - 1260
  • [7] HG-search: multi-stage search for heterogeneous graph neural networks
    Sun, Hongmin
    Kan, Ao
    Liu, Jianhao
    Du, Wei
    APPLIED INTELLIGENCE, 2025, 55 (01)
  • [8] Multi-stage adaptive search vector quantization for image compression
    Refai, W
    HYBRID IMAGE AND SIGNAL PROCESSING V, 1996, 2751 : 50 - 55
  • [9] Resistive CAM Acceleration for Tunable Approximate Computing
    Imani, Mohsen
    Peroni, Daniel
    Rahimi, Abbas
    Rosing, Tajana Simunic
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2019, 7 (02) : 271 - 280
  • [10] Optimized multi-stage minimum-distance-search circuit with feedback-stabilization for fully-parallel associative memories
    Kamimura, K
    Rahman, KM
    Mattausch, HJ
    Koide, T
    2004 47TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2004, : 161 - 164