Open Long-Tailed Recognition in a Dynamic World

被引:10
|
作者
Liu, Ziwei [1 ]
Miao, Zhongqi [2 ]
Zhan, Xiaohang [3 ]
Wang, Jiayun [2 ]
Gong, Boqing [4 ]
Yu, Stella X. [2 ]
机构
[1] Nanyang Technol Univ, Singapore 639798, Singapore
[2] Univ Calif Berkeley, Int Comp Sci Inst, Berkeley, CA 94720 USA
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Google Inc, Mountain View, CA 94043 USA
关键词
Tail; Visualization; Head; Training; Task analysis; Measurement; Magnetic heads; Long-tailed recognition; few-shot learning; active learning;
D O I
10.1109/TPAMI.2022.3200091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real world data often exhibits a long-tailed and open-ended (i.e., with unseen classes) distribution. A practical recognition system must balance between majority (head) and minority (tail) classes, generalize across the distribution, and acknowledge novelty upon the instances of unseen classes (open classes). We define Open Long-Tailed Recognition++ (OLTR++) as learning from such naturally distributed data and optimizing for the classification accuracy over a balanced test set which includes both known and open classes. OLTR++ handles imbalanced classification, few-shot learning, open-set recognition, and active learning in one integrated algorithm, whereas existing classification approaches often focus only on one or two aspects and deliver poorly over the entire spectrum. The key challenges are: 1) how to share visual knowledge between head and tail classes, 2) how to reduce confusion between tail and open classes, and 3) how to actively explore open classes with learned knowledge. Our algorithm, OLTR++, maps images to a feature space such that visual concepts can relate to each other through a memory association mechanism and a learned metric (dynamic meta-embedding) that both respects the closed world classification of seen classes and acknowledges the novelty of open classes. Additionally, we propose an active learning scheme based on visual memory, which learns to recognize open classes in a data-efficient manner for future expansions. On three large-scale open long-tailed datasets we curated from ImageNet (object-centric), Places (scene-centric), and MS1M (face-centric) data, as well as three standard benchmarks (CIFAR-10-LT, CIFAR-100-LT, and iNaturalist-18), our approach, as a unified framework, consistently demonstrates competitive performance. Notably, our approach also shows strong potential for the active exploration of open classes and the fairness analysis of minority groups.
引用
收藏
页码:1836 / 1851
页数:16
相关论文
共 50 条
  • [21] Balanced Product of Calibrated Experts for Long-Tailed Recognition
    Aimar, Emanuel Sanchez
    Jonnarth, Arvi
    Felsberg, Michael
    Kuhlmann, Marco
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19967 - 19977
  • [22] Feature fusion network for long-tailed visual recognition
    Zhou, Xuesong
    Zhai, Junhai
    Cao, Yang
    PATTERN RECOGNITION, 2023, 144
  • [23] Exploiting the Tail Data for Long-Tailed Face Recognition
    Song, Guo
    Liu, Rujie
    Wang, Mengjiao
    Meng, Zhang
    Nie, Shijie
    Lina, Septiana
    Abe, Narishige
    IEEE ACCESS, 2022, 10 : 97945 - 97953
  • [24] Attentive Feature Augmentation for Long-Tailed Visual Recognition
    Wang, Weiqiu
    Zhao, Zhicheng
    Wang, Pingyu
    Su, Fei
    Meng, Hongying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 5803 - 5816
  • [25] Disentangling Label Distribution for Long-tailed Visual Recognition
    Hong, Youngkyu
    Han, Seungju
    Choi, Kwanghee
    Seo, Seokjun
    Kim, Beomsu
    Chang, Buru
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6622 - 6632
  • [26] Enhanced Long-Tailed Recognition With Contrastive CutMix Augmentation
    Pan, Haolin
    Guo, Yong
    Yu, Mianjie
    Chen, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 4215 - 4230
  • [27] A dual progressive strategy for long-tailed visual recognition
    Hong Liang
    Guoqing Cao
    Mingwen Shao
    Qian Zhang
    Machine Vision and Applications, 2024, 35
  • [28] Class-Balanced Regularization for Long-Tailed Recognition
    Xu, Yuge
    Lyu, Chuanlong
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [29] RETHINKING LONG-TAILED VISUAL RECOGNITION WITH DYNAMIC PROBABILITY SMOOTHING AND FREQUENCY WEIGHTED FOCUSING
    Nah, Wan Jun
    Ng, Chun Chet
    Lin, Che-Tsung
    Lee, Yeong Khang
    Kew, Jie Long
    Tan, Zhi Qin
    Chan, Chee Seng
    Zach, Christopher
    Lai, Shang-Hong
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 435 - 439
  • [30] Domain Balancing: Face Recognition on Long-Tailed Domains
    Cao, Dong
    Zhu, Xiangyu
    Huang, Xingyu
    Guo, Jianzhu
    Lei, Zhen
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5670 - 5678