An analytic formulation of convolutional neural network learning for pattern recognition

被引:2
|
作者
Zhuang, Huiping [1 ]
Lin, Zhiping [2 ]
Yang, Yimin [3 ]
Toh, Kar-Ann [4 ]
机构
[1] South China Univ Technol, Shien Ming Wu Sch Intelligent Engn, Guangzhou 510460, Peoples R China
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[3] Western Univ, Dept Elect & Comp Engn, London, ON, Canada
[4] Yonsei Univ, Sch Elect & Elect Engn, Seoul 03722, South Korea
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Pattern classification; Neural network learning; Analytic learning; Convolutional neural network; Small-sample-size problem; ALGORITHM;
D O I
10.1016/j.ins.2024.121317
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Training convolutional neural networks (CNNs) using back-propagation (BP) is a time-consuming and resource-intensive process, primarily due to the need to iterate over the dataset multiple times. In contrast, analytic learning aims to train neural networks in a single epoch, offering a potential solution to these challenges. However, existing studies of analytic learning have been limited to multilayer perceptrons (MLPs). In this article, we propose an analytic formulation for convolutional neural network learning (ACnnL), which represents a significant advancement towards non-iterative learning paradigms for CNNs. Our formulation demonstrates that ACnnL extends the principles of MLP regularization constraints. From the implicit regularization and network interpretability viewpoints, we provide insights into why CNNs often exhibit superior generalization capabilities. The ACnnL is validated by conducting classification tasks on benchmark datasets such as MNIST, FashionMNIST, CIFAR10, CIFAR100 and Tiny-ImageNet. It is encouraging that the ACnnL trains CNNs in a significantly fast manner with reasonably close prediction accuracies to those using BP. In particular, a 5-layer vanilla CNN trained by ACnnL gave an accuracy of 0.9931, 0.9155, 0.7049 and 0.4628 for these datasets. The ACnnL achieves training speeds that are approximately 17 times faster than BP on GPU and 113 times faster than BP on CPU, while maintaining competitive prediction accuracies. Moreover, our experiments disclose a unique advantage of ACnnL under the small-sample scenario when training data are scarce or expensive. In a nutshell, an analytic method which deals well with small-sample size data has been put forward for the first time for fast CNN training with inherent network interpretability.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] The design of cellular neural network with ratio memory for pattern learning and recognition
    Wu, CY
    Cheng, CH
    PROCEEDINGS OF THE 2000 6TH IEEE INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS (CNNA 2000), 2000, : 301 - 307
  • [42] Gait Recognition Using Convolutional Neural Network
    Sheth, Abhishek
    Sharath, Meghana
    Reddy, Sai Charan
    Sindhu, K.
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (01) : 107 - 118
  • [43] Flower Recognition Based on Convolutional Neural Network
    Zhang, Xu
    Han, Ding
    Bai, Fengshan
    Ma, Ziyin
    2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019), 2019, : 333 - 338
  • [44] Convolutional neural network vectors for speaker recognition
    Hourri, Soufiane
    Nikolov, Nikola S.
    Kharroubi, Jamal
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 389 - 400
  • [45] Target Recognition Based on Convolutional Neural Network
    Wang Liqiang
    Wang Xin
    Xi Fubiao
    Dong Jian
    LIDAR IMAGING DETECTION AND TARGET RECOGNITION 2017, 2017, 10605
  • [46] Road sign recognition with Convolutional Neural Network
    Bouti, Amal
    Mahraz, Mohamed Adnane
    Riffi, Jamal
    Tairi, Hamid
    2018 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV2018), 2018,
  • [47] Convolutional Neural Network Architecture for Semaphore Recognition
    Li, Wanchong
    Yang, Yuliang
    Wang, Mengyuan
    Zhang, Linhao
    Zhu, Mengyu
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 559 - 562
  • [48] Emotion Recognition Using a Convolutional Neural Network
    Zatarain-Cabada, Ramon
    Lucia Barron-Estrada, Maria
    Gonzalez-Hernandez, Francisco
    Rodriguez-Rangel, Hector
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2017, PT II, 2018, 10633 : 208 - 219
  • [49] Dynamic Convolutional Neural Network for Activity Recognition
    You, Chih-Hsiang
    Chiang, Chen-Kuo
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [50] Shallow Convolutional Neural Networks for Pattern Recognition Problems
    Gorokhovatskyi, Oleksii
    Peredrii, Olena
    2018 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA STREAM MINING & PROCESSING (DSMP), 2018, : 459 - 463