Fine-grained image classification based on TinyVit object location and graph convolution network

被引:0
|
作者
Zheng, Shijie [1 ]
Wang, Gaocai [1 ]
Yuan, Yujian [1 ]
Huang, Shuqiang [2 ]
机构
[1] Guangxi Univ, Sch Comp & Elect & Informat, Nanning 530004, Peoples R China
[2] Jinan Univ, Coll Cyber Secur, Guangzhou 510632, Peoples R China
基金
中国国家自然科学基金;
关键词
Fine-grained image classification; TinyVit; Object location; Spatial relationship feature learning; Graph convolution network;
D O I
10.1016/j.jvcir.2024.104120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fine-grained image classification is a branch of image classification. Recently, vision transformer has made excellent progress in the field of image recognition. Its self -attention mechanism can extract very effective image feature information. However, feeding fixed -size image blocks into the network introduces additional noise, which is detrimental to extract discriminative features for fine-grained images. The vision transformer's network model is large, making it difficult to utilize in practice. Moreover, many of today's fine-grained image classification methods focus on mining discriminative features while ignoring the connections within the image. To address these problems, we propose a novel method based on the lightweight TinyVit backbone network. Our approach utilizes the self -attention weight values of TinyVit as a guide to construct an effective object location (OL) module that cuts and enlarges the object area, providing the network with the opportunity to concentrate on the local object. Additionally, we employ the graph convolutional network (GCN) to create a spatial relationship feature learning (SRFL) module that captures spatial context information between image blocks in TinyVit with the help of the transformer's self -attention weights. OL and SRFL collaborate to jointly guide the classification task. The experimental results show that the proposed method achieved competitive performance, with the second -highest classification faccuracy on both the CUB -200-2011 and NABirds datasets. When tested on the Stanford Dogs dataset, our approach outperformed many popular methods. Our code is uploaded on https://gith ub.com/hhhj1999/SRFL_OL.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Graph-Propagation Based Correlation Learning for Weakly Supervised Fine-Grained Image Classification
    Wang, Zhihui
    Wang, Shijie
    Li, Haojie
    Dou, Zhi
    Li, Jianjun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12289 - 12296
  • [22] GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval
    Li, Wenhao
    Zhu, Hongqing
    Yang, Suyi
    Wang, Pengyu
    Zhang, Han
    Neural Computing and Applications, 2022, 34 (23) : 21387 - 21401
  • [23] A Fine-Grained Indoor Location-Based Social Network
    Elhamshary, Moustafa
    Basalamah, Anas
    Youssef, Moustafa
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2017, 16 (05) : 1203 - 1217
  • [24] GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval
    Li, Wenhao
    Zhu, Hongqing
    Yang, Suyi
    Wang, Pengyu
    Zhang, Han
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (23): : 21387 - 21401
  • [25] GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval
    Wenhao Li
    Hongqing Zhu
    Suyi Yang
    Pengyu Wang
    Han Zhang
    Neural Computing and Applications, 2022, 34 : 21387 - 21401
  • [26] Fine-Grained Classification of Optical Remote Sensing Ship Images Based on Deep Convolution Neural Network
    Chen, Yantong
    Zhang, Zhongling
    Chen, Zekun
    Zhang, Yanyan
    Wang, Junsheng
    REMOTE SENSING, 2022, 14 (18)
  • [27] Efficient multi-granularity network for fine-grained image classification
    Jiabao Wang
    Yang Li
    Hang Li
    Xun Zhao
    Rui Zhang
    Zhuang Miao
    Journal of Real-Time Image Processing, 2022, 19 : 853 - 866
  • [28] Efficient multi-granularity network for fine-grained image classification
    Wang, Jiabao
    Li, Yang
    Li, Hang
    Zhao, Xun
    Zhang, Rui
    Miao, Zhuang
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (05) : 853 - 866
  • [29] Pixel Saliency Based Encoding for Fine-Grained Image Classification
    Yin, Chao
    Zhang, Lei
    Liu, Ji
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 274 - 285
  • [30] Hybrid ViT-CNN Network for Fine-Grained Image Classification
    Shao, Ran
    Bi, Xiao-Jun
    Chen, Zheng
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1109 - 1113