Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism

被引:0
|
作者
Haiyuan Chen
Lianglun Cheng
Guoheng Huang
Ganghan Zhang
Jiaying Lan
Zhiwen Yu
Chi-Man Pun
Wing-Kuen Ling
机构
[1] Guangdong University of Technology,School of Computer Science and Technology
[2] South China University of Technology,School of Computer Science and Engineering
[3] University of Macau,Department of Computer and Information Science
[4] Guangdong University of Technology,School of Information Engineering
来源
Applied Intelligence | 2022年 / 52卷
关键词
Attention mechanism; Feature filtering; Fine-grained visual classification; Self-supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
Although the existing Fine-Grained Visual Classification (FGVC) researches has made some progress, there are still some deficiencies need to be refined. Specifically, 1. The feature maps are used directly by most methods after they are extracted from the original images, which lacks further processing of feature maps and may lead irrelevant features to negatively affect network performance; 2. In many methods, the utilize of feature maps is relatively simple, and the relationship between feature maps that helpful for accurate classification is ignored. 3. Due to the high similarity between subcategories as well as the randomness and instability of training, the network prediction results may sometimes not accurate enough. To this end, we propose an efficient Self-supervised Attention Filtering and Multi-scale Features Network (SA-MFN) to improve the accuracy of FGVC, which consists of three modules. The first one is the Self-supervised Attention Map Filter, which is proposed to extract the initial attention maps of subcategories and filter out the most distinguishable and representative local attention maps. The second module is the Multi-scale Attention Map Generator, which extracts a global spatial feature map from the filtered attention maps and then concatenates it with the filtered attention maps. The third module is the Reiterative Prediction, in which the first prediction result of the network is re-utilized by this module to improve the accuracy and stability. Experimental results show that our SA-MFN outperforms the state-of-the-art methods on multiple fine-grained classification datasets, especially on the dataset of Stanford Cars, the proposed network achieves the accuracy of 94.7%.
引用
收藏
页码:15673 / 15689
页数:16
相关论文
共 50 条
  • [21] Fine-Grained Detection Model Based on Attention Mechanism and Multi-Scale Feature Fusion for Cocoon Sorting
    Zheng, Han
    Guo, Xueqiang
    Ma, Yuejia
    Zeng, Xiaoxi
    Chen, Jun
    Zhang, Taohong
    AGRICULTURE-BASEL, 2024, 14 (05):
  • [22] Fine-Grained Self-Supervised Learning with Jigsaw puzzles for medical image classification
    Park W.
    Ryu J.
    Comput. Biol. Med., 2024,
  • [23] Ship fine-grained classification network based on multi-scale feature fusion
    Chen, Lisu
    Wang, Qian
    Zhu, Enyan
    Feng, Daolun
    Wu, Huafeng
    Liu, Tao
    OCEAN ENGINEERING, 2025, 318
  • [24] Multi-Scale Feature Transformer Based Fine-Grained Image Classification Method
    Zhang T.
    Cai C.
    Luo X.
    Zhu Y.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (04): : 70 - 75
  • [25] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
    Dakshayani Himabindu D.
    Praveen Kumar S.
    Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in), 1600, Brno University of Technology (27): : 59 - 67
  • [26] A fine-grained image classification algorithm based on self-supervised learning and multi-feature fusion of blood cells
    Jia, Nan
    Guo, Jingxia
    Li, Yan
    Tang, Siyuan
    Xu, Li
    Liu, Liang
    Xing, Junfeng
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [27] Fine-grained Face Anti-Spoofing based on Recursive Self-Attention and Multi-Scale Fusion
    Xie, Shichuang
    Wu, Jiasheng
    Chen, Yanli
    Han, Meng
    Wu, Ting
    Qiao, Tong
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1435 - 1442
  • [28] Patch-wise self-supervised visual representation learning: a fine-grained approach
    Javidani, Ali
    Sadeghi, Mohammad Amin
    Araabi, Babak Nadjar
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (06)
  • [29] MSEC: Multi-Scale Erasure and Confusion for fine-grained image classification
    Zhang, Yan
    Sun, Yongsheng
    Wang, Nian
    Gao, Zijian
    Chen, Feng
    Wang, Chenfei
    Tang, Jun
    NEUROCOMPUTING, 2021, 449 : 1 - 14
  • [30] FEATURE COMPARISON BASED CHANNEL ATTENTION FOR FINE-GRAINED VISUAL CLASSIFICATION
    Jia, Shukun
    Bai, Yan
    Zhang, Jing
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1776 - 1780