Multi-Stage Tunable Approximate Search in Resistive Associative Memory

被引:10
|
作者
Imani, Mohsen [1 ]
Rahimi, Abbas [2 ]
Mercati, Pietro [1 ]
Rosing, Tajana Simunic [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Associative memory; approximate computing; resistive memory; GPUs; ternary content addressable memory (TCAM); non-volatile memory;
D O I
10.1109/TMSCS.2017.2665462
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
General-purpose graphics processing units (GPGPUs), as programmable accelerators, improve energy efficiency by integrating a large number of relatively small cores. In this paper, we focus on improving energy efficiency of such processing core by integrating an associative memory where function responses are prestored. Associative memories can search and recall function responses for a subset of input values therefore avoiding the actual function execution on the processing core that leads to energy saving. We propose a novel low-energy Resistive Multi-stage Associative Memory (ReMAM) architecture to significantly reduce energy of a search operation by employing selective row activation and in-advance precharging techniques. ReMAM splits the search operations in a ternary content addressable memory (TCAM) to a number of shorter searches in consecutive stages. Then, it selectively activates TCAM rows at each stage based on the hits of previous stages, thus enabling energy savings. The proposed inadvance precharging technique mitigates the delay of the sequential TCAM search and limits the number of precharges to two low-cost steps. ReMAM further implements approximation on the selective TCAM blocks to reduce the search energy that relaxes the function output in a fine-grained granularity with very low impact on accuracy of the results. Its multi-stage search operation makes ReMAM applicable to many applications such as search engines, sorting, image coding, pattern recognition, query processing, and machine learning. In this work, we show an application of proposed ReMAM on AMD Southern Island GPUs. Our experimental evaluation shows that ReMAM reduces on average GPGPU energy consumption by 35 percent in the exact mode, and 58 percent in approximate mode with average relative error lower than 10 percent. These energy savings are 1.8 x and 1.5 x higher than state-of-the-art associative memories used in GPGPUs in exact and approximate modes.
引用
收藏
页码:17 / 29
页数:13
相关论文
共 50 条
  • [31] Search Your Memory ! - An Associative Memory Based Desktop Search System
    Chen, Jidong
    Guo, Hang
    Wu, Wentao
    Xie, Chunxin
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 1099 - 1101
  • [32] A Multi-Stage Memory Augmented Neural Network for Machine Reading Comprehension
    Yu, Seunghak
    Indurthi, Sathish
    Back, Seohyun
    Lee, Haejun
    MACHINE READING FOR QUESTION ANSWERING, 2018, : 21 - 30
  • [33] Multi-Stage Memory Efficient Strassen's Matrix Multiplication on GPU
    Krishnan, Arjun Gopala
    Goswami, Dhrubajyoti
    2021 IEEE 28TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, DATA, AND ANALYTICS (HIPC 2021), 2021, : 212 - 221
  • [34] Modeling of vague causal relation in a societal system by multi-foldld multi-stage approximate reasoning
    Maeda, H
    Nobsada, Y
    LARGE SCALE SYSTEMS: THEORY AND APPLICATIONS 1998 (LSS'98), VOL 1, 1999, : 319 - 324
  • [35] Approximate Analysis of Decentralized, Multi-Stage, Pull-Type Production/Inventory Systems
    Cigdem Z. Gurgur
    Tayfur Altiok
    Annals of Operations Research, 2004, 125 : 95 - 116
  • [36] Approximate analysis of decentralized, multi-stage, pull-type production/inventory systems
    Gurgur, CZ
    Altiok, T
    ANNALS OF OPERATIONS RESEARCH, 2004, 125 (1-4) : 95 - 116
  • [37] Associative Memories to Accelerate Approximate Nearest Neighbor Search
    Gripon, Vincent
    Loewe, Matthias
    Vermet, Franck
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [38] Learning Multi-Stage Multi-Grained Semantic Embeddings for E-Commerce Search
    Wang, Binbin
    Li, Mingming
    Zeng, Zhixiong
    Zhuo, Jingwei
    Wang, Songlin
    Xu, Sulong
    Long, Bo
    Yan, Weipeng
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 411 - 415
  • [39] A resistive DAC for a multi-stage sigma-delta modulator DAC with dynamic element matching
    Irfansyah, Astria Nur
    Lehmann, Torsten
    Jenkins, Julian
    Tong, Tianle
    Hamilton, Tara Julia
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2019, 98 (01) : 109 - 123
  • [40] Improved Differentiable Architecture Search With Multi-Stage Progressive Partial Channel Connections
    Xue, Yu
    Lu, Changchang
    Neri, Ferrante
    Qin, Jiafeng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 32 - 43