Robust Architecture-Agnostic and Noise Resilient Training of Photonic Deep Learning Models

被引:17
|
作者
Kirtas, Manos [1 ]
Passalis, Nikolaos [1 ]
Mourgias-Alexandris, George [2 ]
Dabos, George [2 ]
Pleros, Nikos [2 ]
Tefas, Anastasios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Deep Learning Grp, Computat Intelligence, Thessaloniki 54124, Greece
[2] Aristotle Univ Thessaloniki, Dept Informat, Wireless, Networks Grp,Photon Syst, Thessaloniki 54124, Greece
基金
欧盟地平线“2020”;
关键词
Photonics; Training; Neurons; Neuromorphics; Modulation; Adaptation models; Optical noise; Photonic deep learning; neural network initialization; constrains-aware training;
D O I
10.1109/TETCI.2022.3182765
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neuromorphic photonic accelerators for Deep Learning (DL) have increasingly gained attention over the recent years due to their ability for ultra fast matrix-based calculations and low power consumption providing a great potential for DL implementations to deal with a wide range of different applications. At the same time, physical properties of the optical components hinder their application since they introduce a number of limitations, such as easily saturated activation functions as well as the existence of various noise sources. As a result, photonic DL models are especially challenging to be trained and deployed, compared with regular DL models, since traditionally used methods do not take into account the aforementioned constraints. To overcome these limitations and motivated by the fact that the information lost in one layer cannot be easily recovered when gradient-descent based algorithms are employed, we propose a novel training method for photonic neuromorphic architectures that is capable of taking into account a wide range of limitations of the actual hardware, including noise sources and easily saturated activation mechanisms. Compared to existing works, the proposed method takes a more holistic view of the training process, focusing both on the initialization process, as well as on the actual weight updates. The effectiveness of the proposed method is demonstrated on a variety of different problems and photonic neural network (PNN) architectures, including a noisy photonic recurrent neural network evaluated on high-frequency time series forecasting and a deep photonic feed-forward setup consisting of a transmitter, noisy channel, and receiver, which is used as an intensity modulation/direct detection system (IM/DD).
引用
收藏
页码:140 / 149
页数:10
相关论文
共 50 条
  • [1] Protecting Deep Neural Network Intellectual Property with Architecture-Agnostic Input Obfuscation
    Olney, Brooks
    Karam, Robert
    PROCEEDINGS OF THE 32ND GREAT LAKES SYMPOSIUM ON VLSI 2022, GLSVLSI 2022, 2022, : 111 - 115
  • [2] Adversary Agnostic Robust Deep Reinforcement Learning
    Qu, Xinghua
    Gupta, Abhishek
    Ong, Yew-Soon
    Sun, Zhu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6146 - 6157
  • [3] Runtime vs. Manual Data Distribution for Architecture-Agnostic Shared-Memory Programming Models
    Dimitrios S. Nikolopoulos
    Eduard Ayguadé
    Constantine D. Polychronopoulos
    International Journal of Parallel Programming, 2002, 30 : 225 - 255
  • [4] Silicon integrated photonic-electronic neuron for noise-resilient deep learning
    Roumpos, Ioannis
    de Marinis, Lorenzo
    Kovaios, Stefanos
    Kincaid, Peter Seigo
    Paolini, Emilio
    Tsakyridis, Apostolos
    Moralis-Pegios, Miltiadis
    Berciano, Mathias
    Ferraro, Filippo
    Bode, Dieter
    Srinivasan, Srinivasan Ashwyn
    Pantouvaki, Marianna
    Andriolli, Nicola
    Contestabile, Giampiero
    Pleros, Nikos
    Vyrsokinos, Konstantinos
    OPTICS EXPRESS, 2024, 32 (20): : 34264 - 34274
  • [5] Runtime vs. manual data distribution for architecture-agnostic shared-memory programming models
    Nikolopoulos, DS
    Ayguadé, E
    Polychronopoulos, CD
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2002, 30 (04) : 225 - 255
  • [6] The Complexity of Adversarially Robust Proper Learning of Halfspaces with Agnostic Noise
    Diakonikolas, Ilias
    Kane, Daniel M.
    Manurangsi, Pasin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [7] Hybrid Noise-Resilient Deep Learning Architecture for Modulation Classification in Cognitive Radio Networks
    Ivanov, Antoni
    Tonchev, Krasimir
    Poulkov, Vladimir
    Al-Shatri, Hussein
    Klein, Anja
    FUTURE ACCESS ENABLERS FOR UBIQUITOUS AND INTELLIGENT INFRASTRUCTURES, FABULOUS 2019, 2019, 283 : 214 - 227
  • [8] Training Robust Deep Collaborative Filtering Models via Adversarial Noise Propagation
    Chen, Hai
    Qian, Fulan
    Liu, Chang
    Zhang, Yanping
    Su, Hang
    Zhao, Shu
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (01)
  • [9] Training Noise-Robust Deep Neural Networks via Meta-Learning
    Wang, Zhen
    Hu, Guosheng
    Hu, Qinghua
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4523 - 4532
  • [10] Ptolemy: Architecture Support for Robust Deep Learning
    Gan, Yiming
    Qiu, Yuxian
    Leng, Jingwen
    Guo, Minyi
    Zhu, Yuhao
    2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, : 241 - 255