Mixed precision quantization of silicon optical neural network chip

被引:1
|
作者
Zhang, Ye [1 ]
Wang, Ruiting [2 ,3 ]
Zhang, Yejin [2 ,3 ]
Pan, Jiaoqing [2 ,3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Beijing 100192, Peoples R China
[2] Chinese Acad Sci, Key Lab Semicond Mat Sci, Inst Semicond, Beijing 100083, Peoples R China
[3] Univ Chinese Acad Sci, Ctr Mat Sci & Optoelect Engn, Beijing 100049, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
GENETIC ALGORITHM; END;
D O I
10.1016/j.optcom.2024.131231
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In recent years, the field of neural network research has witnessed remarkable advancements in various domains. One of the emerging approaches is the integration of photonic computing, which leverages the unique properties of light for ultra-fast information processing. In this article, we establish a mixed precision quantization model to silicon-based optical neural networks and evaluates their performance on the MNIST and Fashion-MNIST datasets. Through a genetic algorithm- based optimization process, we achieve significant parameter compression while maintaining competitive accuracy. Our findings demonstrate that with an average quantization bitwidth of 4.5 bits on the MNIST dataset, we achieve an impressive 85.94% reduction in parameter size compared to traditional 32-bit networks, with only a marginal accuracy drop of 0.65%. Similarly, on the Fashion-MNIST dataset, we achieve an average quantization bitwidth of 5.67 bits, resulting in an 82.28% reduction in parameter size with a slight accuracy drop of 0.8%.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Bias and precision of neural network and deterministic approaches for predicting pulpwood chip distributions
    Schultz, EB
    Matney, TG
    PROCEEDINGS OF THE 1996 SOCIETY OF AMERICAN FORESTERS CONVENTION: DIVERSE FORESTS, ABUNDANT OPPORTUNITIES, AND EVOLVING REALITIES, 1996, : 73 - 73
  • [32] A mixed-signal VLSI neural network with on-chip learning
    Mirhassani, M
    Ahmadi, M
    Miller, WC
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 591 - 594
  • [33] A Dynamic Execution Neural Network Processor for Fine-Grained Mixed-Precision Model Training Based on Online Quantization Sensitivity Analysis
    Liu, Ruoyang
    Wei, Chenhan
    Yang, Yixiong
    Wang, Wenxun
    Yuan, Binbin
    Yang, Huazhong
    Liu, Yongpan
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2024, 59 (09) : 3082 - 3093
  • [34] Joint Pruning and Channel-Wise Mixed-Precision Quantization for Efficient Deep Neural Networks
    Motetti, Beatrice Alessandra
    Risso, Matteo
    Burrello, Alessio
    Macii, Enrico
    Poncino, Massimo
    Pagliari, Daniele Jahier
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (11) : 2619 - 2633
  • [35] Training of Mixed-Signal Optical Convolutional Neural Networks With Reduced Quantization Levels
    Zhu, Zheyuan
    Ulseth, Joseph
    Li, Guifang
    Pang, Shuo
    IEEE ACCESS, 2021, 9 : 56645 - 56652
  • [36] Automatic Mixed-Precision Quantization Search of BERT
    Zhao, Changsheng
    Hua, Ting
    Shen, Yilin
    Lou, Qian
    Jin, Hongxia
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3427 - 3433
  • [37] Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution
    Zhang Zhaoyang
    Shao Wenqi
    Gu Jinwei
    Wang Xiaogang
    Luo Ping
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [38] A Novel Mixed-Precision Quantization Approach for CNNs
    Wu, Dan
    Wang, Yanzhi
    Fei, Yuqi
    Gao, Guowang
    IEEE ACCESS, 2025, 13 : 49309 - 49319
  • [39] Chip-Based High-Dimensional Optical Neural Network
    Xinyu Wang
    Peng Xie
    Bohan Chen
    Xingcai Zhang
    Nano-Micro Letters, 2022, 14 (12) : 578 - 586
  • [40] Chip-Based High-Dimensional Optical Neural Network
    Wang, Xinyu
    Xie, Peng
    Chen, Bohan
    Zhang, Xingcai
    NANO-MICRO LETTERS, 2022, 14 (01)