PathNet: a novel multi-pathway convolutional neural network for few-shot image classification from scratch

被引:0
|
作者
Fan, Zhonghua [1 ]
Sun, Dongbai [1 ,2 ,4 ]
Yu, Hongying [3 ,4 ]
Zhang, Weidong [1 ]
机构
[1] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing 100083, Peoples R China
[2] Sun Yat Sen Univ, Sch Mat Sci & Engn, Guangzhou 510275, Peoples R China
[3] Sun Yat Sen Univ, Sch Mat, Guangzhou 510275, Peoples R China
[4] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Innovat Grp Marine Engn Mat & Corros Control, Zhuhai 519080, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Computer vision; Multi-pathway; Image classification; Global attention;
D O I
10.1007/s00530-024-01330-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, advanced computer vision models have trended toward deeper and larger network architectures, and model depth is often considered an important feature for achieving superior performance. While deeper networks can help solve complex vision tasks, they also raise issues such as model space complexity and parallelization of long tandem block structures. Therefore, we revisit the network design space by using a shallower depth to reduce the complexity of the vertical spatial structure, horizontally extending multiple computational pathways to improve model capacity and scalability, and highly optimizing the internal modeling to fully exploit the inherent inductive biases in ConvNet and the intrinsic benefits of global attention to improve the overall performance. In this paper, we propose a novel 16-layer shallow depth multi-pathway parallel convolutional neural network, called PathNet, which can be used as a generic backbone for few-shot image classification. We evaluate the effectiveness of PathNet by training from scratch. Experimental results show that PathNet achieves a top-1 accuracy of 54.06% on the Oxford Flowers-102 dataset, 95.16% on the Cifar10 dataset, 76.02% on the Cifar100 dataset, and 61.85% on the TinyImageNet dataset, providing a competitive advantage and great potential in terms of accuracy, scalability, and parallelization compared to the state-of-the-art models.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Total Relation Network with Attention for Few-Shot Image Classification
    Li X.-X.
    Liu Z.-Y.
    Wu J.-J.
    Cao J.
    Ma Z.-Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (02): : 371 - 384
  • [22] FEW-SHOT IMAGE CLASSIFICATION WITH MULTI-FACET PROTOTYPES
    Yan, Kun
    Bouraoui, Zied
    Wang, Ping
    Jameel, Shoaib
    Schockaert, Steven
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1740 - 1744
  • [23] Deep Relation Network for Hyperspectral Image Few-Shot Classification
    Gao, Kuiliang
    Liu, Bing
    Yu, Xuchu
    Qin, Jinchun
    Zhang, Pengqiang
    Tan, Xiong
    REMOTE SENSING, 2020, 12 (06)
  • [24] PANet: Pluralistic Attention Network for Few-Shot Image Classification
    Cao, Wenming
    Li, Tianyuan
    Liu, Qifan
    He, Zhiquan
    NEURAL PROCESSING LETTERS, 2024, 56 (04)
  • [25] Local Mutual Metric Network for Few-Shot Image Classification
    Li, Yaohui
    Li, Huaxiong
    Chen, Haoxing
    Chen, Chunlin
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 443 - 454
  • [26] Multi-task classification network for few-shot learning
    Ji, Zhong
    Liu, Yuanheng
    Wang, Xuan
    Liu, Jingren
    Cao, Jiale
    Yu, Yunlong
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2025, 14 (01)
  • [27] Integrating deformable CNN and attention mechanism into multi-scale graph neural network for few-shot image classification
    Liu, Yongmin
    Xiao, Fengjiao
    Zheng, Xinying
    Deng, Weihao
    Ma, Haizhi
    Su, Xinyao
    Wu, Lei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [28] A Convolutional Neural Network Method for Boundary Optimization Enables Few-Shot Learning for Biomedical Image Segmentation
    Rutter, Erica M.
    Lagergren, John H.
    Flores, Kevin B.
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER AND MEDICAL IMAGE LEARNING WITH LESS LABELS AND IMPERFECT DATA, DART 2019, MIL3ID 2019, 2019, 11795 : 190 - 198
  • [29] Image Recognition of Mine Water Inrush Based on Bilinear Convolutional Neural Network with Few-Shot Learning
    Zhang, Shuai
    Du, Yuanze
    Zhao, Yingwang
    Zhou, Lifu
    ACS OMEGA, 2024, 9 (10): : 12027 - 12036
  • [30] Few-shot Image Classification Algorithm Based on Multi-scale Attention and Residual Network
    Wang, Qi
    Jin, Huazhong
    Yan, Meng
    Li, Lin
    2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 641 - 645