A survey of fine-grained visual categorization based on deep learning

被引:0
|
作者
Xie Yuxiang [1 ]
Gong Quanzhi [1 ]
Luan Xidao [2 ]
Yan Jie [1 ]
Zhang Jiahui [1 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410000, Peoples R China
[2] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410003, Peoples R China
基金
中国国家自然科学基金;
关键词
deep learning; fine-grained visual categorization; convolutional neural network (CNN); visual attention; ATTENTION; NETWORK;
D O I
10.23919/JSEE.2022.000155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning has achieved excellent results in various tasks in the field of computer vision, especially in fine-grained visual categorization. It aims to distinguish the subordinate categories of the label-level categories. Due to high intra-class variances and high inter-class similarity, the fine-grained visual categorization is extremely challenging. This paper first briefly introduces and analyzes the related public datasets. After that, some of the latest methods are reviewed. Based on the feature types, the feature processing methods, and the overall structure used in the model, we divide them into three types of methods: methods based on general convolutional neural network (CNN) and strong supervision of parts, methods based on single feature processing, and methods based on multiple feature processing. Most methods of the first type have a relatively simple structure, which is the result of the initial research. The methods of the other two types include models that have special structures and training processes, which are helpful to obtain discriminative features. We conduct a specific analysis on several methods with high accuracy on public datasets. In addition, we support that the focus of the future research is to solve the demand of existing methods for the large amount of the data and the computing power. In terms of technology, the extraction of the subtle feature information with the burgeoning vision transformer (ViT) network is also an important research direction.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] ProtoSimi: label correction for fine-grained visual categorization
    Shen, Jialiang
    Yao, Yu
    Huang, Shaoli
    Wang, Zhiyong
    Zhang, Jing
    Wang, Ruxing
    Yu, Jun
    Liu, Tongliang
    MACHINE LEARNING, 2024, 113 (04) : 1903 - 1920
  • [22] Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization
    Ye, Shuo
    Peng, Qinmu
    Sun, Wenju
    Xu, Jiamiao
    Wang, Yu
    You, Xinge
    Cheung, Yiu-Ming
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5092 - 5102
  • [23] Hierarchical Part Matching for Fine-Grained Visual Categorization
    Xie, Lingxi
    Tian, Qi
    Hong, Richang
    Yan, Shuicheng
    Zhang, Bo
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1641 - 1648
  • [24] Learning more discriminative clues with gradual attention for fine-grained visual categorization
    Xu, Qin
    Zhang, Mengquan
    Li, Yun
    Tao, Zhifu
    IMAGE AND VISION COMPUTING, 2023, 136
  • [25] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification
    Rao, Yongming
    Chen, Guangyi
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1005 - 1014
  • [26] Hierarchical Self-Distilled Feature Learning for Fine-Grained Visual Categorization
    Hu, Yutao
    Jiang, Xiaolong
    Liu, Xuhui
    Luo, Xiaoyan
    Hu, Yao
    Cao, Xianbin
    Zhang, Baochang
    Zhang, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, : 1 - 14
  • [27] Fine-Grained Visual Categorization of Fasteners in Overhaul Processes
    Taheritanjani, Sajjad
    Haladjian, Juan
    Bruegge, Bernd
    CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, : 241 - 248
  • [28] ProtoSimi: label correction for fine-grained visual categorization
    Jialiang Shen
    Yu Yao
    Shaoli Huang
    Zhiyong Wang
    Jing Zhang
    Ruxing Wang
    Jun Yu
    Tongliang Liu
    Machine Learning, 2024, 113 : 1903 - 1920
  • [29] Fine-Grained Visual Categorization via Multi-stage Metric Learning
    Qian, Qi
    Jin, Rong
    Zhu, Shenghuo
    Lin, Yuanqing
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3716 - 3724
  • [30] Webly-Supervised Fine-Grained Visual Categorization via Deep Domain Adaptation
    Xu, Zhe
    Huang, Shaoli
    Zhang, Ya
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1100 - 1113