Reinforcement co-Learning of Deep and Spiking Neural Networks for Energy-Efficient Mapless Navigation with Neuromorphic Hardware

被引:35
|
作者
Tang, Guangzhi [1 ]
Kumar, Neelesh [1 ]
Michmizos, Konstantinos P. [1 ]
机构
[1] Rutgers State Univ, Computat Brain Lab, Dept Comp Sci, New Brunswick, NJ 08854 USA
关键词
D O I
10.1109/IROS45743.2020.9340948
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Energy-efficient mapless navigation is crucial for mobile robots as they explore unknown environments with limited on-board resources. Although the recent deep reinforcement learning (DRL) approaches have been successfully applied to navigation, their high energy consumption limits their use in several robotic applications. Here, we propose a neuromorphic approach that combines the energy-efficiency of spiking neural networks with the optimality of DRL and benchmark it in learning control policies for mapless navigation. Our hybrid framework, spiking deep deterministic policy gradient (SDDPG), consists of a spiking actor network (SAN) and a deep critic network, where the two networks were trained jointly using gradient descent. The co-learning enabled synergistic information exchange between the two networks, allowing them to overcome each other's limitations through a shared representation learning. To evaluate our approach, we deployed the trained SAN on Intel's Loihi neuromorphic processor. When validated on simulated and real-world complex environments, our method on Loihi consumed 75 times less energy per inference as compared to DDPG on Jetson TX2, and also exhibited a higher rate of successful navigation to the goal, which ranged from 1% to 4.2% and depended on the forward-propagation timestep size. These results reinforce our ongoing efforts to design brain-inspired algorithms for controlling autonomous robots with neuromorphic hardware.
引用
收藏
页码:6090 / 6097
页数:8
相关论文
共 50 条
  • [1] TRAINING DEEP SPIKING NEURAL NETWORKS FOR ENERGY-EFFICIENT NEUROMORPHIC COMPUTING
    Srinivasan, Gopalakrishnan
    Lee, Chankyu
    Sengupta, Abhronil
    Panda, Priyadarshini
    Sarwar, Syed Shakib
    Roy, Kaushik
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8549 - 8553
  • [2] Spiking Neural Network on Neuromorphic Hardware for Energy-Efficient Unidimensional SLAM
    Tang, Guangzhi
    Shah, Arpit
    Michmizos, Konstantinos P.
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4176 - 4181
  • [3] Benchmarking Deep Spiking Neural Networks on Neuromorphic Hardware
    Ostrau, Christoph
    Homburg, Jonas
    Klarhorst, Christian
    Thies, Michael
    Rueckert, Ulrich
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 610 - 621
  • [4] Spiking Neural Network Discovers Energy-Efficient Hexapod Motion in Deep Reinforcement Learning
    Naya, Katsumi
    Kutsuzawa, Kyo
    Owaki, Dai
    Hayashibe, Mitsuhiro
    IEEE ACCESS, 2021, 9 (09): : 150345 - 150354
  • [5] Energy-efficient Hardware Design for Spiking Neural Networks (Extended Abstract)
    Moitra, Abhishek
    Yin, Ruokai
    Panda, Priyadarshini
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 543 - 544
  • [6] An Energy-Efficient Hardware Accelerator for Hierarchical Deep Reinforcement Learning
    Shiri, Aidin
    Prakash, Bharat
    Mazumder, Arnab Neelim
    Waytowich, Nicholas R.
    Oates, Tim
    Mohsenin, Tinoosh
    2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS), 2021,
  • [7] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition
    Yongqiang Cao
    Yang Chen
    Deepak Khosla
    International Journal of Computer Vision, 2015, 113 : 54 - 66
  • [8] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition
    Cao, Yongqiang
    Chen, Yang
    Khosla, Deepak
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) : 54 - 66
  • [9] Fast and Energy-efficient deep Neuromorphic Learning
    Goeltz, Julian
    Kriener, Laura
    Sabado, Virginie
    Petrovici, Mihai A.
    ERCIM NEWS, 2021, (125): : 17 - 18
  • [10] In-Hardware Learning of Multilayer Spiking Neural Networks on a Neuromorphic Processor
    Shrestha, Amar
    Fang, Haowen
    Rider, Daniel Patrick
    Mei, Zaidao
    Qiu, Qinru
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 367 - 372