Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism

被引:0
|
作者
Haiyuan Chen
Lianglun Cheng
Guoheng Huang
Ganghan Zhang
Jiaying Lan
Zhiwen Yu
Chi-Man Pun
Wing-Kuen Ling
机构
[1] Guangdong University of Technology,School of Computer Science and Technology
[2] South China University of Technology,School of Computer Science and Engineering
[3] University of Macau,Department of Computer and Information Science
[4] Guangdong University of Technology,School of Information Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Attention mechanism; Feature filtering; Fine-grained visual classification; Self-supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Although the existing Fine-Grained Visual Classification (FGVC) researches has made some progress, there are still some deficiencies need to be refined. Specifically, 1. The feature maps are used directly by most methods after they are extracted from the original images, which lacks further processing of feature maps and may lead irrelevant features to negatively affect network performance; 2. In many methods, the utilize of feature maps is relatively simple, and the relationship between feature maps that helpful for accurate classification is ignored. 3. Due to the high similarity between subcategories as well as the randomness and instability of training, the network prediction results may sometimes not accurate enough. To this end, we propose an efficient Self-supervised Attention Filtering and Multi-scale Features Network (SA-MFN) to improve the accuracy of FGVC, which consists of three modules. The first one is the Self-supervised Attention Map Filter, which is proposed to extract the initial attention maps of subcategories and filter out the most distinguishable and representative local attention maps. The second module is the Multi-scale Attention Map Generator, which extracts a global spatial feature map from the filtered attention maps and then concatenates it with the filtered attention maps. The third module is the Reiterative Prediction, in which the first prediction result of the network is re-utilized by this module to improve the accuracy and stability. Experimental results show that our SA-MFN outperforms the state-of-the-art methods on multiple fine-grained classification datasets, especially on the dataset of Stanford Cars, the proposed network achieves the accuracy of 94.7%.
引用
收藏
页码:15673 / 15689
页数:16
相关论文
共 50 条
  • [41] Self-supervised 3D face reconstruction based on multi-scale feature fusion and dual attention mechanism
    Zhou D.-K.
    Zhang C.
    Yang X.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (10): : 2428 - 2437
  • [42] MFANet: A Collar Classification Network Based on Multi-Scale Features and an Attention Mechanism
    Qin, Xiao
    Ya, Shanshan
    Yuan, Changan
    Chen, Dingjia
    Long, Long
    Liao, Huixian
    MATHEMATICS, 2023, 11 (05)
  • [43] Self-supervised facial expression recognition with fine-grained feature selection
    An, Heng-Yu
    Jia, Rui-Sheng
    VISUAL COMPUTER, 2024, 40 (10): : 7001 - 7013
  • [44] A Progressive Gated Attention Model for Fine-Grained Visual Classification
    Zhu, Qiangxi
    Li, Zhixin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2063 - 2068
  • [45] Learning Hierarchal Channel Attention for Fine-grained Visual Classification
    Guan, Xiang
    Wang, Guoqing
    Xu, Xing
    Bin, Yi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5011 - 5019
  • [46] A collaborative gated attention network for fine-grained visual classification
    Zhu, Qiangxi
    Kuang, Wenlan
    Li, Zhixin
    DISPLAYS, 2023, 79
  • [47] Hierarchical attention vision transformer for fine-grained visual classification
    Hu, Xiaobin
    Zhu, Shining
    Peng, Taile
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 91
  • [48] Fine-Grained Visual Classification Network Based on Fusion Pooling and Attention Enhancement
    Xiao B.
    Guo J.
    Zhang X.
    Wang M.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (07): : 661 - 670
  • [49] Attention to fine-grained information: hierarchical multi-scale network for retinal vessel segmentation
    Chengzhi Lyu
    Guoqing Hu
    Dan Wang
    The Visual Computer, 2022, 38 : 345 - 355
  • [50] Multi-Scale CNN for Fine-Grained Image Recognition
    Won, Chee Sun
    IEEE ACCESS, 2020, 8 : 116663 - 116674