Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism

被引:0
|
作者
Haiyuan Chen
Lianglun Cheng
Guoheng Huang
Ganghan Zhang
Jiaying Lan
Zhiwen Yu
Chi-Man Pun
Wing-Kuen Ling
机构
[1] Guangdong University of Technology,School of Computer Science and Technology
[2] South China University of Technology,School of Computer Science and Engineering
[3] University of Macau,Department of Computer and Information Science
[4] Guangdong University of Technology,School of Information Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Attention mechanism; Feature filtering; Fine-grained visual classification; Self-supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Although the existing Fine-Grained Visual Classification (FGVC) researches has made some progress, there are still some deficiencies need to be refined. Specifically, 1. The feature maps are used directly by most methods after they are extracted from the original images, which lacks further processing of feature maps and may lead irrelevant features to negatively affect network performance; 2. In many methods, the utilize of feature maps is relatively simple, and the relationship between feature maps that helpful for accurate classification is ignored. 3. Due to the high similarity between subcategories as well as the randomness and instability of training, the network prediction results may sometimes not accurate enough. To this end, we propose an efficient Self-supervised Attention Filtering and Multi-scale Features Network (SA-MFN) to improve the accuracy of FGVC, which consists of three modules. The first one is the Self-supervised Attention Map Filter, which is proposed to extract the initial attention maps of subcategories and filter out the most distinguishable and representative local attention maps. The second module is the Multi-scale Attention Map Generator, which extracts a global spatial feature map from the filtered attention maps and then concatenates it with the filtered attention maps. The third module is the Reiterative Prediction, in which the first prediction result of the network is re-utilized by this module to improve the accuracy and stability. Experimental results show that our SA-MFN outperforms the state-of-the-art methods on multiple fine-grained classification datasets, especially on the dataset of Stanford Cars, the proposed network achieves the accuracy of 94.7%.
引用
收藏
页码:15673 / 15689
页数:16
相关论文
共 50 条
  • [31] Attention-based supervised contrastive learning on fine-grained image classification
    Li, Qian
    Wu, Weining
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)
  • [32] Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification
    Wang, Jiahui
    Xu, Qin
    Jiang, Bo
    Luo, Bin
    Tang, Jinhui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4529 - 4542
  • [33] Based on the multi-scale information sharing network of fine-grained attention for agricultural pest detection
    Wang Linfeng
    Liu Yong
    Liu Jiayao
    Wang Yunsheng
    Xu Shipu
    PLOS ONE, 2023, 18 (10):
  • [34] Fine-Grained Image Classification for Crop Disease Based on Attention Mechanism
    Yang, Guofeng
    He, Yong
    Yang, Yong
    Xu, Beibei
    FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [35] WEB-SUPERVISED NETWORK FOR FINE-GRAINED VISUAL CLASSIFICATION
    Zhang, Chuanyi
    Ya, Yazhou
    Zhang, Jiachao
    Chen, Jiaxin
    Huang, Pu
    Zhang, Jian
    Tang, Zhenmin
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [36] Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems
    Shu, Yangyang
    van den Hengel, Anton
    Liu, Lingqiao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11392 - 11401
  • [37] A FREE LUNCH FROM VIT: ADAPTIVE ATTENTION MULTI-SCALE FUSION TRANSFORMER FOR FINE-GRAINED VISUAL RECOGNITION
    Zhang, Yuan
    Cao, Jian
    Zhang, Ling
    Liu, Xiangcheng
    Wang, Zhiyi
    Ling, Feng
    Chen, Weiqian
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3234 - 3238
  • [38] Fine-Grained Image Classification Combining Swin and Multi-Scale Feature Fusion
    Xiang, Jianwen
    Chen, Minrong
    Yang, Baibing
    Computer Engineering and Applications, 2023, 59 (20): : 147 - 157
  • [39] Multi-scale attention-based adaptive feature fusion network for fine-grained ship classification in remote sensing scenarios
    Liu, Kun
    Zhang, Xiaomeng
    Xu, Zhijing
    Liu, Sidong
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
  • [40] Multi-scale attention-based adaptive feature fusion network for fine-grained ship classification in remote sensing scenarios
    Liu, Kun
    Zhang, Xiaomeng
    Xu, Zhijing
    Liu, Sidong
    Journal of Applied Remote Sensing, 1600, 18 (03):