An analytic formulation of convolutional neural network learning for pattern recognition

被引:2
|
作者
Zhuang, Huiping [1 ]
Lin, Zhiping [2 ]
Yang, Yimin [3 ]
Toh, Kar-Ann [4 ]
机构
[1] South China Univ Technol, Shien Ming Wu Sch Intelligent Engn, Guangzhou 510460, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Western Univ, Dept Elect & Comp Engn, London, ON, Canada
[4] Yonsei Univ, Sch Elect & Elect Engn, Seoul 03722, South Korea
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Pattern classification; Neural network learning; Analytic learning; Convolutional neural network; Small-sample-size problem; ALGORITHM;
D O I
10.1016/j.ins.2024.121317
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Training convolutional neural networks (CNNs) using back-propagation (BP) is a time-consuming and resource-intensive process, primarily due to the need to iterate over the dataset multiple times. In contrast, analytic learning aims to train neural networks in a single epoch, offering a potential solution to these challenges. However, existing studies of analytic learning have been limited to multilayer perceptrons (MLPs). In this article, we propose an analytic formulation for convolutional neural network learning (ACnnL), which represents a significant advancement towards non-iterative learning paradigms for CNNs. Our formulation demonstrates that ACnnL extends the principles of MLP regularization constraints. From the implicit regularization and network interpretability viewpoints, we provide insights into why CNNs often exhibit superior generalization capabilities. The ACnnL is validated by conducting classification tasks on benchmark datasets such as MNIST, FashionMNIST, CIFAR10, CIFAR100 and Tiny-ImageNet. It is encouraging that the ACnnL trains CNNs in a significantly fast manner with reasonably close prediction accuracies to those using BP. In particular, a 5-layer vanilla CNN trained by ACnnL gave an accuracy of 0.9931, 0.9155, 0.7049 and 0.4628 for these datasets. The ACnnL achieves training speeds that are approximately 17 times faster than BP on GPU and 113 times faster than BP on CPU, while maintaining competitive prediction accuracies. Moreover, our experiments disclose a unique advantage of ACnnL under the small-sample scenario when training data are scarce or expensive. In a nutshell, an analytic method which deals well with small-sample size data has been put forward for the first time for fast CNN training with inherent network interpretability.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Relative ordering learning in spiking neural network for pattern recognition
    Lin, Zhitao
    Ma, De
    Meng, Jianyi
    Chen, Linna
    NEUROCOMPUTING, 2018, 275 : 94 - 106
  • [22] Verification Code Recognition Based On Active Learning And Convolutional Neural Network
    Chen, Xingqi
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 443 - 447
  • [23] Recognition of learning-centered emotions using a convolutional neural network
    Gonzalez-Hernandez, Francisco
    Zatarain-Cabada, Ramon
    Barron-Estrada, Maria Lucia
    Rodriguez-Rangel, Hector
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3325 - 3336
  • [24] Leukocyte recognition with convolutional neural network
    Lin, Liqun
    Wang, Weixing
    Chen, Bolin
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2018, 13 : 1 - 8
  • [25] A Novel Event-Driven Spiking Convolutional Neural Network for Electromyography Pattern Recognition
    Xu, Mengjuan
    Chen, Xiang
    Sun, Antong
    Zhang, Xu
    Chen, Xun
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (09) : 2604 - 2615
  • [26] Pattern Recognition of Partial Discharges in DC XLPE Cables Based on Convolutional Neural Network
    Zhu Y.
    Xu Y.
    Chen X.
    Sheng G.
    Jiang X.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2020, 35 (03): : 659 - 668
  • [27] Convolutional Neural Network Models for Scattering Pattern Recognition of Scanning Electron Microscopy Images
    Phankokkruad, Manop
    Wacharawichanant, Sirirat
    2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 27 - 31
  • [28] Artificial Auditory Perception Pattern Recognition System Based on Spatiotemporal Convolutional Neural Network
    Fang, Xia
    Fang, Han
    Feng, Zhan
    Wang, Jie
    Zhou, Libin
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [29] A Deep convolutional neural network with residual blocks for wafer map defect pattern recognition
    Amogne, Zemenu Endalamaw
    Wang, Fu-Kwun
    Chou, Jia-Hong
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2022, 38 (01) : 343 - 357
  • [30] Network Protocol Recognition Based on Convolutional Neural Network
    Wenbo Feng
    Zheng Hong
    Lifa Wu
    Menglin Fu
    Yihao Li
    Peihong Lin
    中国通信, 2020, 17 (04) : 125 - 139