Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection

被引:0
|
作者
Zhang, Hongquan [1 ,2 ,3 ]
Gao, Bin-Bin [2 ]
Zeng, Yi [2 ]
Tian, Xudong [1 ,3 ]
Tan, Xin [1 ,3 ]
Zhang, Zhizhong [1 ,3 ]
Qu, Yanyun [4 ]
Liu, Jun [2 ]
Xie, Yuan [1 ,3 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Tencent YouTu Lab, Shenzhen, Guangdong, Peoples R China
[3] East China Normal Univ, Chongqing Inst, Shanghai, Peoples R China
[4] Xiamen Univ, Xiamen, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class-incremental object detection (CIOD) is a real-world desired capability, requiring an object detector to continuously adapt to new tasks without forgetting learned ones, with the main challenge being catastrophic forgetting. Many methods based on distillation and replay have been proposed to alleviate this problem. However, they typically learn on a pure visual backbone, neglecting the powerful representation capabilities of textual cues, which to some extent limits their performance. In this paper, we propose task-aware language-image representation to mitigate catastrophic forgetting, introducing a new paradigm for language-image-based CIOD. First of all, we demonstrate the significant advantage of language-image detectors in mitigating catastrophic forgetting. Secondly, we propose a learning task-aware language-image representation method that overcomes the existing drawback of directly utilizing the language-image detector for CIOD. More specifically, we learn the language-image representation of different tasks through an insulating approach in the training stage, while using the alignment scores produced by task-specific language-image representation in the inference stage. Through our proposed method, language-image detectors can be more practical for CIOD. We conduct extensive experiments on COCO 2017 and Pascal VOC 2007 and demonstrate that the proposed method achieves state-of-the-art results under the various CIOD settings.
引用
收藏
页码:7096 / 7104
页数:9
相关论文
共 50 条
  • [31] CIOD: an intelligent class-incremental object detection system with nearest mean of exemplars
    Ren S.
    He Y.
    Wang X.
    Guo K.
    Barra S.
    Li J.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (08) : 10657 - 10671
  • [32] AN END-TO-END ARCHITECTURE FOR CLASS-INCREMENTAL OBJECT DETECTION WITH KNOWLEDGE DISTILLATION
    Hao, Yu
    Fu, Yanwei
    Jiang, Yu-Gang
    Tian, Qi
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1 - 6
  • [33] Representation Robustness and Feature Expansion for Exemplar-Free Class-Incremental Learning
    Luo, Yong
    Ge, Hongwei
    Liu, Yuxuan
    Wu, Chunguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5306 - 5320
  • [34] Automated neural foraminal stenosis grading via task-aware structural representation learning
    He, Xiaoxu
    Leung, Stephanie
    Warrington, James
    Shmuilovich, Olga
    Li, Shuo
    NEUROCOMPUTING, 2018, 287 : 185 - 195
  • [35] A Few-Shot Class-Incremental Learning Approach for Intrusion Detection
    Wang, Tingting
    Lv, Qiujian
    Hu, Bo
    Sun, Degang
    30TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2021), 2021,
  • [36] PDE Learning of Filtering and Propagation for Task-Aware Facial Intrinsic Image Analysis
    Liang, Lingyu
    Jin, Lianwen
    Xu, Yong
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (02) : 1021 - 1034
  • [37] Class-Incremental Gesture Recognition Learning with Out-of-Distribution Detection
    Li, Mingxue
    Cong, Yang
    Liu, Yuyang
    Sun, Gan
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 1503 - 1508
  • [38] Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning
    Cheraghian, Ali
    Rahman, Shafin
    Fang, Pengfei
    Roy, Soumava Kumar
    Petersson, Lars
    Harandi, Mehrtash
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2534 - 2543
  • [39] Sharpness-aware gradient guidance for few-shot class-incremental learning
    Chen, Runhang
    Jing, Xiao-Yuan
    Wu, Fei
    Chen, Haowen
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [40] eTag: Class-Incremental Learning via Embedding Distillation and Task-Oriented Generation
    Huang, Libo
    Zeng, Yan
    Yang, Chuanguang
    An, Zhulin
    Diao, Boyu
    Xu, Yongjun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12591 - 12599