Reinforcement co-Learning of Deep and Spiking Neural Networks for Energy-Efficient Mapless Navigation with Neuromorphic Hardware

被引:35
|
作者
Tang, Guangzhi [1 ]
Kumar, Neelesh [1 ]
Michmizos, Konstantinos P. [1 ]
机构
[1] Rutgers State Univ, Computat Brain Lab, Dept Comp Sci, New Brunswick, NJ 08854 USA
来源
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2020年
关键词
D O I
10.1109/IROS45743.2020.9340948
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Energy-efficient mapless navigation is crucial for mobile robots as they explore unknown environments with limited on-board resources. Although the recent deep reinforcement learning (DRL) approaches have been successfully applied to navigation, their high energy consumption limits their use in several robotic applications. Here, we propose a neuromorphic approach that combines the energy-efficiency of spiking neural networks with the optimality of DRL and benchmark it in learning control policies for mapless navigation. Our hybrid framework, spiking deep deterministic policy gradient (SDDPG), consists of a spiking actor network (SAN) and a deep critic network, where the two networks were trained jointly using gradient descent. The co-learning enabled synergistic information exchange between the two networks, allowing them to overcome each other's limitations through a shared representation learning. To evaluate our approach, we deployed the trained SAN on Intel's Loihi neuromorphic processor. When validated on simulated and real-world complex environments, our method on Loihi consumed 75 times less energy per inference as compared to DDPG on Jetson TX2, and also exhibited a higher rate of successful navigation to the goal, which ranged from 1% to 4.2% and depended on the forward-propagation timestep size. These results reinforce our ongoing efforts to design brain-inspired algorithms for controlling autonomous robots with neuromorphic hardware.
引用
收藏
页码:6090 / 6097
页数:8
相关论文
共 50 条
  • [31] An Efficient Event-driven Neuromorphic Architecture for Deep Spiking Neural Networks
    Duy-Anh Nguyen
    Duy-Hicu Bui
    Iacopi, Francesca
    Xuan-Tu Tran
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 144 - 149
  • [32] EnsembleSNN: Distributed Assistive STDP Learning for Energy-Efficient Recognition in Spiking Neural Networks
    Panda, Priyadarshini
    Srinivasan, Gopalakrishnan
    Roy, Kaushik
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2629 - 2635
  • [33] An Efficient Deep Reinforcement Learning Algorithm for Mapless Navigation with Gap-Guided Switching Strategy
    Li, Heng
    Qin, Jiahu
    Liu, Qingchen
    Yan, Chengzhen
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 108 (03)
  • [34] Dynamic Spike Bundling for Energy-Efficient Spiking Neural Networks
    Krithivasan, Sarada
    Sen, Sanchari
    Venkataramani, Swagath
    Raghunathan, Anand
    2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
  • [35] Towards Energy-Efficient Sentiment Classification with Spiking Neural Networks
    Chen, Junhao
    Ye, Xiaojun
    Sun, Jingbo
    Li, Chao
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 518 - 529
  • [36] Deep Reinforcement Learning for Energy-Efficient Edge Caching in Mobile Edge Networks
    Deng, Meng
    Huan, Zhou
    Kai, Jiang
    Zheng, Hantong
    Yue, Cao
    Peng, Chen
    CHINA COMMUNICATIONS, 2024, : 1 - 14
  • [37] Deep Reinforcement Learning for Energy-Efficient Data Dissemination Through UAV Networks
    Ali, Abubakar S.
    Al-Habob, Ahmed A.
    Naser, Shimaa
    Bariah, Lina
    Dobre, Octavia A.
    Muhaidat, Sami
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 5567 - 5583
  • [38] Deep Reinforcement Learning for Energy-Efficient Edge Caching in Mobile Edge Networks
    Meng Deng
    Zhou Huan
    Jiang Kai
    Zheng Hantong
    Cao Yue
    Chen Peng
    China Communications, 2024, 21 (11) : 243 - 256
  • [39] Efficient Hardware Implementation for Online Local Learning in Spiking Neural Networks
    Guo, Wenzhe
    Fouda, Mohammed E.
    Eltawil, Ahmed M.
    Salama, Khaled Nabil
    2022 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2022): INTELLIGENT TECHNOLOGY IN THE POST-PANDEMIC ERA, 2022, : 387 - 390
  • [40] E2HRL: An Energy-efficient Hardware Accelerator for Hierarchical Deep Reinforcement Learning
    Shiri, Aidin
    Kallakuri, Uttej
    Rashid, Hasib-Al
    Prakash, Bharat
    Waytowich, Nicholas R.
    Oates, Tim
    Mohsenin, Tinoosh
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (05)