Transformer in optronic neural networks for image classification

被引:2
|
作者
Xu, Chen [1 ]
Sui, Xiubao [1 ]
Liu, Jia [1 ]
Fei, Yuhang [1 ]
Wang, Liping [1 ]
Chen, Qian [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Optical neural network; Fourier optics; Self-attention; Optical convolution;
D O I
10.1016/j.optlastec.2023.109627
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Transformer, with its self-attention mechanism, can acquire global features while parallelizing training. This study proposes an Optronic Vision Transformer (OPViT), which utilizes spatial light modulators (SLMs) and lens group to realize matrix multiplication. This is the first time that Transformer has been introduced into optical neural network, fully exploiting the advantages of photonic computing to minimize its computational cost. Based on the advantages of OPViT, it is reconfigurable, scalable, and has low space complexity. We achieve excellent classification results directly with only one layer of Transformer and two or three layers of optical convolution, achieving 98.70% and 88.93% test accuracy on the MNIST and Fashion-MNIST datasets, respectively. The classification results outperform existing optical convolutional neural networks, and are even comparable to previous optical architectures connected to electronic networks.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Combining Spiking Neural Networks with Artificial Neural Networks for Enhanced Image Classification
    Muramatsu, Naoya
    Yu, Hai-Tao
    Satoh, Tetsuji
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (02) : 252 - 261
  • [32] Adaptive Image Anonymization in the Context of Image Classification with Neural Networks
    Shvai, Nadiya
    Carmona, Arcadi Llanza
    Nakib, Amir
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5051 - 5060
  • [33] Understanding How Image Quality Affects Transformer Neural Networks
    Varga, Domonkos
    SIGNALS, 2024, 5 (03): : 562 - 579
  • [34] Enhanced Image-Based Malware Classification Using Transformer-Based Convolutional Neural Networks (CNNs)
    Ashawa, Moses
    Owoh, Nsikak
    Hosseinzadeh, Salaheddin
    Osamor, Jude
    ELECTRONICS, 2024, 13 (20)
  • [35] Hyperspectral Image Classification Based on Transposed Convolutional Neural Network Transformer
    Liu, Baisen
    Jia, Zongting
    Guo, Penggang
    Kong, Weili
    ELECTRONICS, 2023, 12 (18)
  • [36] Modular Graph Transformer Networks for Multi-Label Image Classification
    Nguyen, Hoang D.
    Vu, Xuan-Son
    Le, Duc-Trong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9092 - 9100
  • [37] Transformer-Based Spiking Neural Networks for Multimodal Audiovisual Classification
    Guo, Lingyue
    Gao, Zeyu
    Qu, Jinye
    Zheng, Suiwu
    Jiang, Runhao
    Lu, Yanfeng
    Qiao, Hong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (03) : 1077 - 1086
  • [38] Image classification by neural networks for the quality control of watches
    Moreira, M
    Fiesler, E
    Pante, G
    PROCEEDINGS ISAI/IFIS 1996 - MEXICO - USA COLLABORATION IN INTELLIGENT SYSTEMS TECHNOLOGIES, 1996, : 141 - 149
  • [39] Representation of Imprecision in Deep Neural Networks for Image Classification
    Zhang, Zuowei
    Liu, Zhunga
    Ning, Liangbo
    Martin, Arnaud
    Xiong, Jiexuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1199 - 1212
  • [40] PreNet: Parallel Recurrent Neural Networks for Image Classification
    Wang, Junbo
    Wang, Wei
    Wang, Liang
    Tan, Tieniu
    COMPUTER VISION, PT II, 2017, 772 : 461 - 473