A survey of fine-grained visual categorization based on deep learning

被引:0
|
作者
Xie Yuxiang [1 ]
Gong Quanzhi [1 ]
Luan Xidao [2 ]
Yan Jie [1 ]
Zhang Jiahui [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410000, Peoples R China
[2] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410003, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; fine-grained visual categorization; convolutional neural network (CNN); visual attention; ATTENTION; NETWORK;
D O I
10.23919/JSEE.2022.000155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning has achieved excellent results in various tasks in the field of computer vision, especially in fine-grained visual categorization. It aims to distinguish the subordinate categories of the label-level categories. Due to high intra-class variances and high inter-class similarity, the fine-grained visual categorization is extremely challenging. This paper first briefly introduces and analyzes the related public datasets. After that, some of the latest methods are reviewed. Based on the feature types, the feature processing methods, and the overall structure used in the model, we divide them into three types of methods: methods based on general convolutional neural network (CNN) and strong supervision of parts, methods based on single feature processing, and methods based on multiple feature processing. Most methods of the first type have a relatively simple structure, which is the result of the initial research. The methods of the other two types include models that have special structures and training processes, which are helpful to obtain discriminative features. We conduct a specific analysis on several methods with high accuracy on public datasets. In addition, we support that the focus of the future research is to solve the demand of existing methods for the large amount of the data and the computing power. In terms of technology, the extraction of the subtle feature information with the burgeoning vision transformer (ViT) network is also an important research direction.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Learning sequentially diversified representations for fine-grained categorization
    Zhang, Lianbo
    Huang, Shaoli
    Liu, Wei
    PATTERN RECOGNITION, 2022, 121
  • [32] Fine-Grained Categorization by Alignments
    Gavves, E.
    Fernando, B.
    Snoek, C. G. M.
    Smeulders, A. W. M.
    Tuytelaars, T.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1713 - 1720
  • [33] Coping with change: Learning invariant and minimum sufficient representations for fine-grained visual categorization
    Ye, Shuo
    Yu, Shujian
    Hou, Wenjin
    Wang, Yu
    You, Xinge
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 237
  • [34] A Survey on Deep Learning-based Fine-grained Object Classification and Semantic Segmentation
    Bo Zhao
    Jiashi Feng
    Xiao Wu
    Shuicheng Yan
    International Journal of Automation and Computing, 2017, 14 (02) : 119 - 135
  • [35] Fine-grained Visual Categorization with 2D-Warping
    Hanselmann, Harald
    Ney, Hermann
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 608 - 613
  • [36] Recombining Vision Transformer Architecture for Fine-Grained Visual Categorization
    Deng, Xuran
    Liu, Chuanbin
    Lu, Zhiying
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 127 - 138
  • [37] A survey on deep learning-based fine-grained object classification and semantic segmentation
    Zhao B.
    Feng J.
    Wu X.
    Yan S.
    International Journal of Automation and Computing, 2017, 14 (2) : 119 - 135
  • [38] Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization
    Xu, Kunran
    Lai, Rui
    Gu, Lin
    Li, Yishi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3488 - 3500
  • [39] SHAPE-GUIDED SEGMENTATION FOR FINE-GRAINED VISUAL CATEGORIZATION
    Sun, Ming
    Yang, Jufeng
    Sun, Bo
    Wang, Kai
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [40] Refined probability distribution module for fine-grained visual categorization
    Zhao, Peipei
    Miao, Qiguang
    Li, Hongsheng
    Liu, Ruyi
    Quan, Yining
    Song, Jianfeng
    NEUROCOMPUTING, 2023, 518 : 533 - 544