Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism

被引：0

作者：

Haiyuan Chen

Lianglun Cheng

Guoheng Huang

Ganghan Zhang

Jiaying Lan

Zhiwen Yu

Chi-Man Pun

Wing-Kuen Ling

机构：

[1] Guangdong University of Technology,School of Computer Science and Technology

[2] South China University of Technology,School of Computer Science and Engineering

[3] University of Macau,Department of Computer and Information Science

[4] Guangdong University of Technology,School of Information Engineering

来源：

Applied Intelligence | 2022年 / 52卷

关键词：

Attention mechanism; Feature filtering; Fine-grained visual classification; Self-supervised learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Although the existing Fine-Grained Visual Classification (FGVC) researches has made some progress, there are still some deficiencies need to be refined. Specifically, 1. The feature maps are used directly by most methods after they are extracted from the original images, which lacks further processing of feature maps and may lead irrelevant features to negatively affect network performance; 2. In many methods, the utilize of feature maps is relatively simple, and the relationship between feature maps that helpful for accurate classification is ignored. 3. Due to the high similarity between subcategories as well as the randomness and instability of training, the network prediction results may sometimes not accurate enough. To this end, we propose an efficient Self-supervised Attention Filtering and Multi-scale Features Network (SA-MFN) to improve the accuracy of FGVC, which consists of three modules. The first one is the Self-supervised Attention Map Filter, which is proposed to extract the initial attention maps of subcategories and filter out the most distinguishable and representative local attention maps. The second module is the Multi-scale Attention Map Generator, which extracts a global spatial feature map from the filtered attention maps and then concatenates it with the filtered attention maps. The third module is the Reiterative Prediction, in which the first prediction result of the network is re-utilized by this module to improve the accuracy and stability. Experimental results show that our SA-MFN outperforms the state-of-the-art methods on multiple fine-grained classification datasets, especially on the dataset of Stanford Cars, the proposed network achieves the accuracy of 94.7%.

引用

页码：15673 / 15689

页数：16

共 50 条

[1] Fine-grained visual classification with multi-scale features based on self-supervised attention filtering mechanism
Chen, Haiyuan
Cheng, Lianglun
Huang, Guoheng
Zhang, Ganghan
Lan, Jiaying
Yu, Zhiwen
Pun, Chi-Man
Ling, Wing-Kuen
APPLIED INTELLIGENCE, 2022, 52 (13) : 15673 - 15689
[2] Siamese self-supervised learning for fine-grained visual classification
Ji, Ruyi
Li, Jiaying
Zhang, Libo
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
[3] Multi-Scale Salient Features Bilinear Attention Fine-Grained Classification Method
Liu G.
Zhan H.
Meng Y.
Wang B.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (11): : 1683 - 1691
[4] Multi-scale network via progressive multi-granularity attention for fine-grained visual classification
An, Chen
Wang, Xiaodong
Wei, Zhiqiang
Zhang, Ke
Huang, Lei
APPLIED SOFT COMPUTING, 2023, 146
[5] Multi-scale local regional attention fusion using visual transformers for fine-grained image classification
Li, Yusong
Xie, Bin
Li, Yuling
Zhang, Jiahao
VISUAL COMPUTER, 2024,
[6] Dual attention guided multi-scale CNN for fine-grained image classification
Liu, Xiaozhang
Zhang, Lifeng
Li, Tao
Wang, Dejian
Wang, Zhaojie
INFORMATION SCIENCES, 2021, 573 : 37 - 45
[7] Multi-scale discriminative regions attention network for fine-grained vehicle classification
Rong, Wen-Zhong
Han, Jin
Cai, Ying-Hao
Liu, Gen
Han, Jin (shnk123@163.com); Cai, Ying-Hao (yinghao.cai@ia.ac.cn), 1600, Taiwan Ubiquitous Information (06): : 164 - 177
[8] Scalenet: A Convolutional Network to Extract Multi-Scale and Fine-Grained Visual Features
Zhang, Jinpeng
Zhang, Jinming
Hu, Guyue
Chen, Yang
Yu, Shan
IEEE ACCESS, 2019, 7 : 147560 - 147570
[9] Multi-scale Sparse Network with Cross-Attention Mechanism for image-based butterflies fine-grained classification
Li, Maopeng
Zhou, Guoxiong
Cai, Weiwei
Li, Jiayong
Li, Mingxuan
He, Mingfang
Hu, Yahui
Li, Liujun
APPLIED SOFT COMPUTING, 2022, 117
[10] On Learning Discriminative Features from Synthesized Data for Self-supervised Fine-Grained Visual Recognition
Wang, Zihu
Liu, Lingqiao
Weston, Scott Ricardo Figueroa
Tian, Samuel
Li, Peng
COMPUTER VISION - ECCV 2024, PT LXXXIX, 2025, 15147 : 101 - 117

← 1 2 3 4 5 →