NearUni: Near-Unitary Training for Efficient Optical Neural Networks

被引:1
|
作者
Eldebiky, Amro [1 ]
Li, Bing [1 ]
Zhang, Grace Li [2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] Tech Univ Darmstadt, Darmstadt, Germany
关键词
COMPACT;
D O I
10.1109/ICCAD57390.2023.10323877
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Optical neural networks with Mach-Zender interferometers (MZIs) have demonstrated advantages over their electronic counterparts in computing efficiency and power consumption. However, implementing the computation with a weight matrix in DNNs using this technique requires the decomposition of the weight matrix into two unitary matrices, because an optical network can only realize a single unitary matrix due to its structural property. Accordingly, a direct implementation of DNNs onto optical networks suffer from a low area efficiency. To address this challenge, in this paper, a near-unitary training framework is proposed. In this framework, a weight matrix in DNNs is first partitioned into square submatrices to reduce the number of MZIs in the optical networks. Afterwards, training is adjusted to make the partitioned submatrices as close to unitary as possible. Such a matrix is then represented further by the sum of a unitary matrix and a sparse matrix. The latter implements the difference between the unitary matrix and the near-unitary matrix after training. In this way, only one optical network is needed to implement this unitary matrix and the low computation load in the sparse matrix can be implemented with area-efficient microring resonators (MRRs). Experimental results show that the area footprint can be reduced by 81.81%, 85.51%, 48.6% for ResNet34, VGG16, and fully connected neural networks, respectively, while the inference accuracy is still maintained on CIFAR100 and MNIST datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Efficient training of unitary optical neural networks
    Lu, Kunrun
    Guo, Xianxin
    OPTICS EXPRESS, 2023, 31 (24) : 39616 - 39623
  • [2] Quantum process tomography of unitary and near-unitary maps
    Baldwin, Charles H.
    Kalev, Amir
    Deutsch, Ivan H.
    PHYSICAL REVIEW A, 2014, 90 (01):
  • [3] Near-Unitary Spin Squeezing in 171Yb
    Braverman, Boris
    Kawasaki, Akio
    Pedrozo-Penafiel, Edwin
    Colombo, Simone
    Shu, Chi
    Li, Zeyang
    Mendez, Enrique
    Yamoah, Megan
    Salvi, Leonardo
    Akamatsu, Daisuke
    Xiao, Yanhong
    Vuletic, Vladan
    PHYSICAL REVIEW LETTERS, 2019, 122 (22)
  • [4] Mega-gratings With near-unitary transmission or diffraction
    Deng, Zi-Lan
    Cao, Yaoyu
    Li, Xiangping
    Wang, Guo Ping
    2017 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2017,
  • [5] Identical Wells, Symmetry Breaking, and the Near-Unitary Limit
    Harshman, N. L.
    FEW-BODY SYSTEMS, 2017, 58 (02)
  • [6] Identical Wells, Symmetry Breaking, and the Near-Unitary Limit
    N. L. Harshman
    Few-Body Systems, 2017, 58
  • [7] Superfluid-insulator transitions of the fermi gas with near-unitary interactions in a periodic potential
    Moon, Eun Gook
    Nikolic, Predrag
    Sachdev, Subir
    PHYSICAL REVIEW LETTERS, 2007, 99 (23)
  • [8] projUNN: efficient method for training deep networks with unitary matrices
    Kiani, Bobak T.
    Balestriero, Randall
    LeCun, Yann
    Lloyd, Seth
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [9] Efficient training of backpropagation neural networks
    Otair, Mohammed A.
    Salameh, Walid A.
    NEURAL NETWORK WORLD, 2006, 16 (04) : 291 - 311
  • [10] Fast and Efficient and Training of Neural Networks
    Yu, Hao
    Wilamowski
    3RD INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION, 2010, : 175 - 181