Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism

被引：0

作者：

Haiyuan Chen

Lianglun Cheng

Guoheng Huang

Ganghan Zhang

Jiaying Lan

Zhiwen Yu

Chi-Man Pun

Wing-Kuen Ling

机构：

[1] Guangdong University of Technology,School of Computer Science and Technology

[2] South China University of Technology,School of Computer Science and Engineering

[3] University of Macau,Department of Computer and Information Science

[4] Guangdong University of Technology,School of Information Engineering

来源：

Applied Intelligence | 2022年 / 52卷

关键词：

Attention mechanism; Feature filtering; Fine-grained visual classification; Self-supervised learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Although the existing Fine-Grained Visual Classification (FGVC) researches has made some progress, there are still some deficiencies need to be refined. Specifically, 1. The feature maps are used directly by most methods after they are extracted from the original images, which lacks further processing of feature maps and may lead irrelevant features to negatively affect network performance; 2. In many methods, the utilize of feature maps is relatively simple, and the relationship between feature maps that helpful for accurate classification is ignored. 3. Due to the high similarity between subcategories as well as the randomness and instability of training, the network prediction results may sometimes not accurate enough. To this end, we propose an efficient Self-supervised Attention Filtering and Multi-scale Features Network (SA-MFN) to improve the accuracy of FGVC, which consists of three modules. The first one is the Self-supervised Attention Map Filter, which is proposed to extract the initial attention maps of subcategories and filter out the most distinguishable and representative local attention maps. The second module is the Multi-scale Attention Map Generator, which extracts a global spatial feature map from the filtered attention maps and then concatenates it with the filtered attention maps. The third module is the Reiterative Prediction, in which the first prediction result of the network is re-utilized by this module to improve the accuracy and stability. Experimental results show that our SA-MFN outperforms the state-of-the-art methods on multiple fine-grained classification datasets, especially on the dataset of Stanford Cars, the proposed network achieves the accuracy of 94.7%.

引用

页码：15673 / 15689

页数：16

共 50 条

[21] Fine-Grained Detection Model Based on Attention Mechanism and Multi-Scale Feature Fusion for Cocoon Sorting
Zheng, Han
Guo, Xueqiang
Ma, Yuejia
Zeng, Xiaoxi
Chen, Jun
Zhang, Taohong
AGRICULTURE-BASEL, 2024, 14 (05):
[22] Fine-Grained Self-Supervised Learning with Jigsaw puzzles for medical image classification
Park W.
Ryu J.
Comput. Biol. Med., 2024,
[23] Ship fine-grained classification network based on multi-scale feature fusion
Chen, Lisu
Wang, Qian
Zhu, Enyan
Feng, Daolun
Wu, Huafeng
Liu, Tao
OCEAN ENGINEERING, 2025, 318
[24] Multi-Scale Feature Transformer Based Fine-Grained Image Classification Method
Zhang T.
Cai C.
Luo X.
Zhu Y.
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (04): : 70 - 75
[25] A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition
Dakshayani Himabindu D.
Praveen Kumar S.
Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in), 1600, Brno University of Technology (27): : 59 - 67
[26] A fine-grained image classification algorithm based on self-supervised learning and multi-feature fusion of blood cells
Jia, Nan
Guo, Jingxia
Li, Yan
Tang, Siyuan
Xu, Li
Liu, Liang
Xing, Junfeng
SCIENTIFIC REPORTS, 2024, 14 (01):
[27] Fine-grained Face Anti-Spoofing based on Recursive Self-Attention and Multi-Scale Fusion
Xie, Shichuang
Wu, Jiasheng
Chen, Yanli
Han, Meng
Wu, Ting
Qiao, Tong
2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1435 - 1442
[28] Patch-wise self-supervised visual representation learning: a fine-grained approach
Javidani, Ali
Sadeghi, Mohammad Amin
Araabi, Babak Nadjar
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (06)
[29] MSEC: Multi-Scale Erasure and Confusion for fine-grained image classification
Zhang, Yan
Sun, Yongsheng
Wang, Nian
Gao, Zijian
Chen, Feng
Wang, Chenfei
Tang, Jun
NEUROCOMPUTING, 2021, 449 : 1 - 14
[30] FEATURE COMPARISON BASED CHANNEL ATTENTION FOR FINE-GRAINED VISUAL CLASSIFICATION
Jia, Shukun
Bai, Yan
Zhang, Jing
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1776 - 1780

← 1 2 3 4 5 →