Multi-Stage Tunable Approximate Search in Resistive Associative Memory

被引:10
|
作者
Imani, Mohsen [1 ]
Rahimi, Abbas [2 ]
Mercati, Pietro [1 ]
Rosing, Tajana Simunic [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Associative memory; approximate computing; resistive memory; GPUs; ternary content addressable memory (TCAM); non-volatile memory;
D O I
10.1109/TMSCS.2017.2665462
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
General-purpose graphics processing units (GPGPUs), as programmable accelerators, improve energy efficiency by integrating a large number of relatively small cores. In this paper, we focus on improving energy efficiency of such processing core by integrating an associative memory where function responses are prestored. Associative memories can search and recall function responses for a subset of input values therefore avoiding the actual function execution on the processing core that leads to energy saving. We propose a novel low-energy Resistive Multi-stage Associative Memory (ReMAM) architecture to significantly reduce energy of a search operation by employing selective row activation and in-advance precharging techniques. ReMAM splits the search operations in a ternary content addressable memory (TCAM) to a number of shorter searches in consecutive stages. Then, it selectively activates TCAM rows at each stage based on the hits of previous stages, thus enabling energy savings. The proposed inadvance precharging technique mitigates the delay of the sequential TCAM search and limits the number of precharges to two low-cost steps. ReMAM further implements approximation on the selective TCAM blocks to reduce the search energy that relaxes the function output in a fine-grained granularity with very low impact on accuracy of the results. Its multi-stage search operation makes ReMAM applicable to many applications such as search engines, sorting, image coding, pattern recognition, query processing, and machine learning. In this work, we show an application of proposed ReMAM on AMD Southern Island GPUs. Our experimental evaluation shows that ReMAM reduces on average GPGPU energy consumption by 35 percent in the exact mode, and 58 percent in approximate mode with average relative error lower than 10 percent. These energy savings are 1.8 x and 1.5 x higher than state-of-the-art associative memories used in GPGPUs in exact and approximate modes.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 50 条
  • [21] ASSOCIATIVE SEARCH NETWORK - A REINFORCEMENT LEARNING ASSOCIATIVE MEMORY
    BARTO, AG
    SUTTON, RS
    BROUWER, PS
    BIOLOGICAL CYBERNETICS, 1981, 40 (03) : 201 - 211
  • [22] Comment on "Unbiased Statistical Analysis for Multi-Stage Proteomic Search Strategies"
    Bern, Marshall
    Kil, Yong J.
    JOURNAL OF PROTEOME RESEARCH, 2011, 10 (04) : 2123 - 2127
  • [23] Adaptation of Decoy Fusion Strategy for Existing Multi-Stage Search Workflows
    Gorshkov, Mikhail V. (mike.gorshkov@gmail.com), 1600, Springer Science and Business Media, LLC (27):
  • [24] Adaptation of Decoy Fusion Strategy for Existing Multi-Stage Search Workflows
    Ivanov, Mark V.
    Levitsky, Lev I.
    Gorshkov, Mikhail V.
    JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2016, 27 (09) : 1579 - 1582
  • [25] Fast codeword search for vector quantization using a multi-stage approach
    Woo, HC
    Barnwell, TP
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2629 - 2632
  • [26] Associative Memristive Memory for Approximate Computing in GPUs
    Ghofrani, Amirali
    Rahimi, Abbas
    Lastras-Montano, Miguel A.
    Benini, Luca
    Gupta, Rajesh K.
    Cheng, Kwang-Ting
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2016, 6 (02) : 222 - 234
  • [27] Multi-stage classification
    Senator, TE
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 386 - 393
  • [28] Multi-Stage Mach–Zehnder Based Continuously Tunable Photonic Delay Line
    Zarlish Mushtaq
    Muhammad Aslam Uqaili
    Abi Waqas
    Bhawani Shankar Chowdhry
    Wireless Personal Communications, 2021, 121 : 1221 - 1231
  • [29] Multi-stage programming
    Taha, W
    Sheard, T
    ACM SIGPLAN NOTICES, 1997, 32 (08) : 321 - 321
  • [30] The multi-stage railgun
    Musolino, A
    Raugi, M
    Rocco, R
    Tellini, A
    IEEE TRANSACTIONS ON MAGNETICS, 2001, 37 (01) : 445 - 449