Training of quantized deep neural networks using a magnetic tunnel junction-based synapse

被引:2
|
作者
Greenberg-Toledo, Tzofnat [1 ]
Perach, Ben [1 ]
Hubara, Itay [1 ,2 ]
Soudry, Daniel [1 ]
Kvatinsky, Shahar [1 ]
机构
[1] Technion Israel Inst Technol, Andrew & Erna Viterbi Fac Elect Engn, IL-3200003 Haifa, Israel
[2] Habana Labs Intel Co, Intel Co, Tel Aviv, Israel
基金
欧洲研究理事会;
关键词
magnetic tunnel junction; memristor; deep neural networks; quantized neural networks; MEMORY;
D O I
10.1088/1361-6641/ac251b
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Quantized neural networks (QNNs) are being actively researched as a solution for the computational complexity and memory intensity of deep neural networks. This has sparked efforts to develop algorithms that support both inference and training with quantized weight and activation values, without sacrificing accuracy. A recent example is the GXNOR framework for stochastic training of ternary and binary neural networks (TNNs and BNNs, respectively). In this paper, we show how magnetic tunnel junction (MTJ) devices can be used to support QNN training. We introduce a novel hardware synapse circuit that uses the MTJ stochastic behaviour to support the quantize update. The proposed circuit enables processing near memory (PNM) of QNN training, which subsequently reduces data movement. We simulated MTJ-based stochastic training of a TNN over the MNIST, SVHN, and CIFAR10 datasets and achieved an accuracy of 98.61% , 93.99% 83.02% , respectively (less than 1% degradation compared to the GXNOR algorithm). We evaluated the synapse array performance potential and showed that the proposed synapse circuit can train TNNs in situ, with 18.3TOPs/W 3TOPs/W for weight update.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Spin Orbit Torque-Assisted Magnetic Tunnel Junction-Based Hardware Trojan
    Kumar, Rajat
    Divyanshu, Divyanshu
    Khan, Danial
    Amara, Selma
    Massoud, Yehia
    ELECTRONICS, 2022, 11 (11)
  • [22] Magnetic Tunnel Junction-Based Spintronic Logic Units Operated by Spin Transfer Torque
    Yao, Xiaofeng
    Harms, Jonathan
    Lyle, Andrew
    Ebrahimi, Farbod
    Zhang, Yisong
    Wang, Jian-Ping
    IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2012, 11 (01) : 120 - 126
  • [23] Nanoscale Tantalum layer impacting magnetic properties of tunnel junction-based molecular devices
    Tyagi, Pawan
    Goulet, Tobias
    MRS COMMUNICATIONS, 2018, 8 (03) : 1024 - 1028
  • [24] Experimental demonstration of magnetic tunnel junction-based computational random-access memory
    Yang Lv
    Brandon R. Zink
    Robert P. Bloom
    Hüsrev Cılasun
    Pravin Khanal
    Salonik Resch
    Zamshed Chowdhury
    Ali Habiboglu
    Weigang Wang
    Sachin S. Sapatnekar
    Ulya Karpuzcu
    Jian-Ping Wang
    npj Unconventional Computing, 1 (1):
  • [25] Design of magnetic tunnel junction-based tunable spin torque oscillator at nanoscale regime
    Dwivedi, Amit Krishna
    Islam, Aminul
    IET CIRCUITS DEVICES & SYSTEMS, 2016, 10 (02) : 121 - 129
  • [26] Impact of Spin Fluctuation on the magnetic properties of Magnetic Tunnel Junction-Based Molecular Spintronic Device (MTJMSD)
    Savadkoohi, Marzieh
    Dahal, Bishnu R.
    Mutungo, Eva
    Grizzle, Andrew
    D'Angelo, Christopher
    Tyagi, Pawan
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON NANOTECHNOLOGY (IEEE NANO 2021), 2021, : 216 - 216
  • [27] Advantages of Prefabricated Tunnel Junction-Based Molecular Spintronics Devices
    Tyagi, Pawan
    Friebe, Edward
    Baker, Collin
    NANO, 2015, 10 (04)
  • [28] BinaryRelax: A Relaxation Approach for Training Deep Neural Networks with Quantized Weights
    Yin, Penghang
    Zhang, Shuai
    Lyu, Jiancheng
    Osher, Stanley
    Qi, Yingyong
    Xin, Jack
    SIAM JOURNAL ON IMAGING SCIENCES, 2018, 11 (04): : 2205 - 2223
  • [29] Fabrication of tunnel junction-based molecular electronics and spintronics devices
    Pawan Tyagi
    Journal of Nanoparticle Research, 2012, 14
  • [30] Fabrication of tunnel junction-based molecular electronics and spintronics devices
    Tyagi, Pawan
    JOURNAL OF NANOPARTICLE RESEARCH, 2012, 14 (10)