Optimization and inverse design of optical activation functions based on neural networks

被引:0
|
作者
Jia, Tao [1 ]
Jiang, Rui [1 ]
Fu, Ziling [1 ]
Xie, Zican [1 ]
Ding, Xin [1 ]
Wang, Zhi [1 ]
机构
[1] Beijing Jiaotong Univ, Sch Phys Sci & Engn, Inst Opt Informat, Key Lab Luminescence & Opt Informat,Minist Educ, Beijing 100044, Peoples R China
基金
国家重点研发计划;
关键词
Optical neural network; Optical nonlinear activation function; Mach-zehnder interferometer; Micro-ring resonator; PHOTONICS;
D O I
10.1016/j.optcom.2024.131370
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
The development of all-optical and electro-optical neural networks represents a rapidly growing field of research, with nonlinear activation functions serving as essential components of these systems. In this study, we employ an artificial neural network model to optimize the performance parameters of two systems based on Mach-Zehnder interferometers and micro-ring resonators. The results demonstrate that the optimized devices can accurately approximate several of the 14 activation functions (with a minimum root mean square error (RMSE) value of-33.1 dB), including Clipped ReLU, Sine, and Exponential. The optimized functions are also applied to an image recognition task using the Modified National Institute of Standards and Technology (MNIST) database, achieving maximum training and validation accuracies of 99.9% and 99.3% in simulation, respectively. Additionally, we introduce an inverse model to design the structural parameters of the coupling regions. Our approach significantly reduces the design time of the MZI-MRR activation function structure and theoretically demonstrates its feasibility and flexibility, providing a valuable example for the broader application of inverse design and optimization methods in optical neural network chips.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Reliability optimization design of ICE parts based on neural networks
    Chen, Xiaowei
    Zhu, Meilin
    Li, Bing
    Ke, Pengbo
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 1998, 26 (05): : 63 - 65
  • [42] Bicomplex Neural Networks with Hypergeometric Activation Functions
    Vieira, Nelson
    ADVANCES IN APPLIED CLIFFORD ALGEBRAS, 2023, 33 (02)
  • [43] Multistability of Neural Networks with a Class of Activation Functions
    Wang, Lili
    Lu, Wenlian
    Chen, Tianping
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 1, PROCEEDINGS, 2009, 5551 : 323 - 332
  • [44] Construction of Activation Functions for Wavelet Neural Networks
    Stepanov, Andrey B.
    PROCEEDINGS OF 2017 XX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2017, : 397 - 399
  • [45] Comparative analysis of activation functions in neural networks
    Kamalov, Firuz
    Nazir, Amril
    Safaraliev, Murodbek
    Cherukuri, Aswani Kumar
    Zgheib, Rita
    2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,
  • [46] MEDIAN ACTIVATION FUNCTIONS FOR GRAPH NEURAL NETWORKS
    Ruiz, Luana
    Gama, Fernando
    Marques, Antonio G.
    Ribeiro, Alejandro
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7440 - 7444
  • [47] Models of neural networks with fuzzy activation functions
    Nguyen, A. T.
    Korikov, A. M.
    INTERNATIONAL CONFERENCE ON MECHANICAL ENGINEERING, AUTOMATION AND CONTROL SYSTEMS 2016, 2017, 177
  • [48] Bicomplex Neural Networks with Hypergeometric Activation Functions
    Nelson Vieira
    Advances in Applied Clifford Algebras, 2023, 33
  • [49] Adaptive activation functions in convolutional neural networks
    Qian, Sheng
    Liu, Hua
    Liu, Cheng
    Wu, Si
    Wong, Hau San
    NEUROCOMPUTING, 2018, 272 : 204 - 212
  • [50] Exponential Stability of Stochastic Delayed Neural Networks with Inverse Holder Activation Functions and Markovian Jump Parameters
    Li, Yingwei
    Wu, Huaiqin
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2014, 2014