Fine-Grained Recognition With Learnable Semantic Data Augmentation

被引:8
|
作者
Pu, Yifan [1 ]
Han, Yizeng [1 ]
Wang, Yulin [1 ]
Feng, Junlan [2 ]
Deng, Chao [2 ]
Huang, Gao [1 ]
机构
[1] Tsinghua Univ, Dept Automat, BNRist, Beijing 100084, Peoples R China
[2] China Mobile Res Inst, Beijing 100053, Peoples R China
关键词
Fine-grained recognition; data augmentation; meta-learning; deep learning; CLASSIFICATION; IMAGE;
D O I
10.1109/TIP.2024.3364500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained image recognition is a longstanding computer vision challenge that focuses on differentiating objects belonging to multiple subordinate categories within the same meta-category. Since images belonging to the same meta-category usually share similar visual appearances, mining discriminative visual cues is the key to distinguishing fine-grained categories. Although commonly used image-level data augmentation techniques have achieved great success in generic image classification problems, they are rarely applied in fine-grained scenarios, because their random editing-region behavior is prone to destroy the discriminative visual cues residing in the subtle regions. In this paper, we propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem. Specifically, we produce diversified augmented samples by translating image features along semantically meaningful directions. The semantic directions are estimated with a covariance prediction network, which predicts a sample-wise covariance matrix to adapt to the large intra-class variation inherent in fine-grained images. Furthermore, the covariance prediction network is jointly optimized with the classification network in a meta-learning manner to alleviate the degenerate solution problem. Experiments on four competitive fine-grained recognition benchmarks (CUB-200-2011, Stanford Cars, FGVC Aircrafts, NABirds) demonstrate that our method significantly improves the generalization performance on several popular classification networks (e.g., ResNets, DenseNets, EfficientNets, RegNets and ViT). Combined with a recently proposed method, our semantic data augmentation approach achieves state-of-the-art performance on the CUB-200-2011 dataset. Source code is available at https://github.com/LeapLabTHU/LearnableISDA.
引用
收藏
页码:3130 / 3144
页数:15
相关论文
共 50 条
  • [31] FINE-GRAINED AND LAYERED OBJECT RECOGNITION
    Wu, Yang
    Zheng, Nanning
    Liu, Yuanliu
    Yuan, Zejian
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (02)
  • [32] SELECTIVE PARTS FOR FINE-GRAINED RECOGNITION
    Li, Dong
    Li, Yali
    Wang, Shengjin
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 922 - 926
  • [33] Deep LSAC for Fine-Grained Recognition
    Lin, Di
    Wang, Yi
    Liang, Lingyu
    Li, Ping
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 200 - 214
  • [34] FgER: Fine-Grained Entity Recognition
    Abhishek
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8008 - 8009
  • [35] A dataset for fine-grained seed recognition
    Yuan, Min
    Lv, Ningning
    Dong, Yongkang
    Hu, Xiaowen
    Lu, Fuxiang
    Zhan, Kun
    Shen, Jiacheng
    Wu, Xiaolin
    Zhu, Liye
    Xie, Yufei
    SCIENTIFIC DATA, 2024, 11 (01)
  • [36] Ultra Fine-Grained Image Semantic Embedding
    Juan, Da-Cheng
    Lu, Chun-To
    Li, Zhen
    Peng, Futang
    Timofeev, Aleksei
    Chen, Yi-Ting
    Gao, Yaxi
    Duerig, Tom
    Tomkins, Andrew
    Ravi, Sujith
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 277 - 285
  • [37] Fine-grained Semantic Textual Similarity for Serbian
    Batanovic, Vuk
    Cvetanovic, Milos
    Nikolic, Bosko
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1370 - 1378
  • [38] Semantic-Aware Fine-Grained Correspondence
    Hu, Yingdong
    Wang, Renhao
    Zhang, Kaifeng
    Gao, Yang
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 97 - 115
  • [39] Dual attention interactive fine-grained classification network based on data augmentation
    Zhu, Qiangxi
    Kuang, Wenlan
    Li, Zhixin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [40] Class Activation Mapping Guided Data Augmentation for Fine-Grained Visual Classification
    Guo W.
    Wang T.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (11): : 1698 - 1704