Coping with change: Learning invariant and minimum sufficient representations for fine-grained visual categorization

被引:3
|
作者
Ye, Shuo [1 ]
Yu, Shujian [2 ,3 ]
Hou, Wenjin [1 ]
Wang, Yu [1 ]
You, Xinge [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Huibei, Peoples R China
[2] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
[3] UiT The Arctic Univ Norway, Machine Learning Grp, Tromso, Norway
基金
国家重点研发计划;
关键词
Fine-grained visual categorization; Invariant risk minimization; Information bottleneck; ENTROPY;
D O I
10.1016/j.cviu.2023.103837
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization (FGVC) is a challenging task due to similar visual appearances between various species. Previous studies always implicitly assume that the training and test data have the same underlying distributions, and that features extracted by modern backbone architectures remain discriminative and generalize well to unseen test data. However, we empirically justify that these conditions are not always true on benchmark datasets. To this end, we combine the merits of invariant risk minimization (IRM) and information bottleneck (IB) principle to learn invariant and minimum sufficient (IMS) representations for FGVC, such that the overall model can always discover the most succinct and consistent fine-grained features. We apply the matrix-based Renyi's..-order entropy to simplify and stabilize the training of IB; we also design a ''soft" environment partition scheme to make IRM applicable to FGVC task. To the best of our knowledge, we are the first to address the problem of FGVC from a generalization perspective and develop a new informationtheoretic solution accordingly. Extensive experiments demonstrate the consistent performance gain offered by our IMS. Code is available at: https://github.com/SYe- hub/IMS.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Fine-Grained Visual Categorization of Fasteners in Overhaul Processes
    Taheritanjani, Sajjad
    Haladjian, Juan
    Bruegge, Bernd
    CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, : 241 - 248
  • [22] ProtoSimi: label correction for fine-grained visual categorization
    Jialiang Shen
    Yu Yao
    Shaoli Huang
    Zhiyong Wang
    Jing Zhang
    Ruxing Wang
    Jun Yu
    Tongliang Liu
    Machine Learning, 2024, 113 : 1903 - 1920
  • [23] Fine-Grained Visual Categorization via Multi-stage Metric Learning
    Qian, Qi
    Jin, Rong
    Zhu, Shenghuo
    Lin, Yuanqing
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3716 - 3724
  • [24] 3D Object Representations for Fine-Grained Categorization
    Krause, Jonathan
    Stark, Michael
    Deng, Jia
    Li Fei-Fei
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 554 - 561
  • [25] Fine-Grained Categorization by Alignments
    Gavves, E.
    Fernando, B.
    Snoek, C. G. M.
    Smeulders, A. W. M.
    Tuytelaars, T.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1713 - 1720
  • [26] Recombining Vision Transformer Architecture for Fine-Grained Visual Categorization
    Deng, Xuran
    Liu, Chuanbin
    Lu, Zhiying
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 127 - 138
  • [27] Fine-grained Visual Categorization with 2D-Warping
    Hanselmann, Harald
    Ney, Hermann
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 608 - 613
  • [28] Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization
    Xu, Kunran
    Lai, Rui
    Gu, Lin
    Li, Yishi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3488 - 3500
  • [29] SHAPE-GUIDED SEGMENTATION FOR FINE-GRAINED VISUAL CATEGORIZATION
    Sun, Ming
    Yang, Jufeng
    Sun, Bo
    Wang, Kai
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [30] Refined probability distribution module for fine-grained visual categorization
    Zhao, Peipei
    Miao, Qiguang
    Li, Hongsheng
    Liu, Ruyi
    Quan, Yining
    Song, Jianfeng
    NEUROCOMPUTING, 2023, 518 : 533 - 544