Robust Architecture-Agnostic and Noise Resilient Training of Photonic Deep Learning Models

被引:17
|
作者
Kirtas, Manos [1 ]
Passalis, Nikolaos [1 ]
Mourgias-Alexandris, George [2 ]
Dabos, George [2 ]
Pleros, Nikos [2 ]
Tefas, Anastasios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Deep Learning Grp, Computat Intelligence, Thessaloniki 54124, Greece
[2] Aristotle Univ Thessaloniki, Dept Informat, Wireless, Networks Grp,Photon Syst, Thessaloniki 54124, Greece
基金
欧盟地平线“2020”;
关键词
Photonics; Training; Neurons; Neuromorphics; Modulation; Adaptation models; Optical noise; Photonic deep learning; neural network initialization; constrains-aware training;
D O I
10.1109/TETCI.2022.3182765
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neuromorphic photonic accelerators for Deep Learning (DL) have increasingly gained attention over the recent years due to their ability for ultra fast matrix-based calculations and low power consumption providing a great potential for DL implementations to deal with a wide range of different applications. At the same time, physical properties of the optical components hinder their application since they introduce a number of limitations, such as easily saturated activation functions as well as the existence of various noise sources. As a result, photonic DL models are especially challenging to be trained and deployed, compared with regular DL models, since traditionally used methods do not take into account the aforementioned constraints. To overcome these limitations and motivated by the fact that the information lost in one layer cannot be easily recovered when gradient-descent based algorithms are employed, we propose a novel training method for photonic neuromorphic architectures that is capable of taking into account a wide range of limitations of the actual hardware, including noise sources and easily saturated activation mechanisms. Compared to existing works, the proposed method takes a more holistic view of the training process, focusing both on the initialization process, as well as on the actual weight updates. The effectiveness of the proposed method is demonstrated on a variety of different problems and photonic neural network (PNN) architectures, including a noisy photonic recurrent neural network evaluated on high-frequency time series forecasting and a deep photonic feed-forward setup consisting of a transmitter, noisy channel, and receiver, which is used as an intensity modulation/direct detection system (IM/DD).
引用
收藏
页码:140 / 149
页数:10
相关论文
共 50 条
  • [21] Towards Training Reproducible Deep Learning Models
    Chen, Boyuan
    Wen, Mingzhi
    Shi, Yong
    Lin, Dayi
    Rajbahadur, Gopi Krishnan
    Jiang, Zhen Ming
    2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 2202 - 2214
  • [22] Tensor Normal Training for Deep Learning Models
    Ren, Yi
    Goldfarb, Donald
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [23] A sequential deep learning framework for a robust and resilient network intrusion detection system
    Hore, Soumyadeep
    Ghadermazi, Jalal
    Shah, Ankit
    Bastian, Nathaniel D.
    COMPUTERS & SECURITY, 2024, 144
  • [24] Deep learning techniques for noise-resilient localisation in wireless sensor networks
    Alwan, Nuha A. S.
    Hussain, Zahir M.
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2021, 36 (02) : 59 - 67
  • [25] Deep learning architecture advancements for accurate and robust image registration
    Walvoord, Derek J.
    Couwenhoven, Doug W.
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXVIII, 2019, 11018
  • [26] Marginal Deep Architecture: Stacking Feature Learning Modules to Build Deep Learning Models
    Zhong, Guoqiang
    Zhang, Kang
    Wei, Hongxu
    Zheng, Yuchen
    Dong, Junyu
    IEEE ACCESS, 2019, 7 : 30220 - 30233
  • [27] Silicon photonic architecture for training deep neural networks with direct feedback alignment
    Filipovich, Atthew J.
    Guo, Zhimu
    Al-Qadasi, Mohammed
    Arquez, Bicky A. M.
    Morison, Hugh D.
    Sorger, Volker J.
    Prucnal, Paul R.
    Shekhar, Sudip
    Shastri, Bhavin J.
    OPTICA, 2022, 9 (12) : 1323 - 1332
  • [28] Training Robust Deep Neural Networks via Adversarial Noise Propagation
    Liu, Aishan
    Liu, Xianglong
    Yu, Hang
    Zhang, Chongzhi
    Liu, Qiang
    Tao, Dacheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5769 - 5781
  • [29] An Internet of Agents Architecture for Training and Deployment of Deep Convolutional Models
    Luis Rodriguez-Benitez
    Carlos Córdoba Ruiz
    Luis Cabañero Gómez
    Ramón Hervás
    Luis Jimenez-Linares
    Journal of Signal Processing Systems, 2022, 94 : 283 - 291
  • [30] An Internet of Agents Architecture for Training and Deployment of Deep Convolutional Models
    Rodriguez-Benitez, Luis
    Cordoba-Ruiz, Carlos
    Cabanero, Luis
    Hervas, Ramon
    Jimenez-Linares, L.
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (03): : 283 - 291