Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection

被引:0
|
作者
Zhang, Hongquan [1 ,2 ,3 ]
Gao, Bin-Bin [2 ]
Zeng, Yi [2 ]
Tian, Xudong [1 ,3 ]
Tan, Xin [1 ,3 ]
Zhang, Zhizhong [1 ,3 ]
Qu, Yanyun [4 ]
Liu, Jun [2 ]
Xie, Yuan [1 ,3 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Tencent YouTu Lab, Shenzhen, Guangdong, Peoples R China
[3] East China Normal Univ, Chongqing Inst, Shanghai, Peoples R China
[4] Xiamen Univ, Xiamen, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class-incremental object detection (CIOD) is a real-world desired capability, requiring an object detector to continuously adapt to new tasks without forgetting learned ones, with the main challenge being catastrophic forgetting. Many methods based on distillation and replay have been proposed to alleviate this problem. However, they typically learn on a pure visual backbone, neglecting the powerful representation capabilities of textual cues, which to some extent limits their performance. In this paper, we propose task-aware language-image representation to mitigate catastrophic forgetting, introducing a new paradigm for language-image-based CIOD. First of all, we demonstrate the significant advantage of language-image detectors in mitigating catastrophic forgetting. Secondly, we propose a learning task-aware language-image representation method that overcomes the existing drawback of directly utilizing the language-image detector for CIOD. More specifically, we learn the language-image representation of different tasks through an insulating approach in the training stage, while using the alignment scores produced by task-specific language-image representation in the inference stage. Through our proposed method, language-image detectors can be more practical for CIOD. We conduct extensive experiments on COCO 2017 and Pascal VOC 2007 and demonstrate that the proposed method achieves state-of-the-art results under the various CIOD settings.
引用
收藏
页码:7096 / 7104
页数:9
相关论文
共 50 条
  • [1] Class-incremental object detection
    Dong, Na
    Zhang, Yongqiang
    Ding, Mingli
    Bai, Yancheng
    PATTERN RECOGNITION, 2023, 139
  • [2] Probing Image Compression For Class-Incremental Learning
    Yang, Justin
    Duan, Zhihao
    Peng, Andrew
    Huang, Yuning
    He, Jiangpeng
    Zhu, Fengqing
    2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
  • [3] Few-Shot Class-Incremental Learning for Classification and Object Detection: A Survey
    Zhang, Jinghua
    Liu, Li
    Silven, Olli
    Pietikainen, Matti
    Hu, Dewen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2924 - 2945
  • [4] Dynamic Task Subspace Ensemble for Class-Incremental Learning
    Zhang, Weile
    He, Yuanjian
    Cong, Yulai
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 322 - 334
  • [5] Task-aware asynchronous multi-task model with class incremental contrastive learning for surgical scene understanding
    Lalithkumar Seenivasan
    Mobarakol Islam
    Mengya Xu
    Chwee Ming Lim
    Hongliang Ren
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 921 - 928
  • [6] Class-Incremental Learning Based on Anomaly Detection
    Zhang, Lijuan
    Yang, Xiaokang
    Zhang, Kai
    Li, Yong
    Li, Fu
    Li, Jun
    Li, Dongming
    IEEE ACCESS, 2023, 11 : 69423 - 69438
  • [7] Task-aware asynchronous multi-task model with class incremental contrastive learning for surgical scene understanding
    Seenivasan, Lalithkumar
    Islam, Mobarakol
    Xu, Mengya
    Lim, Chwee Ming
    Ren, Hongliang
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (05) : 921 - 928
  • [8] Task-aware image quality estimators for face detection
    Singh, Praneet
    Reibman, Amy R.
    Eurasip Journal on Image and Video Processing, 2024, 2024 (01)
  • [9] Enhancing class-incremental object detection in remote sensing through instance-aware distillation
    Feng, Hangtao
    Zhang, Lu
    Yang, Xu
    Liu, Zhiyong
    NEUROCOMPUTING, 2024, 583
  • [10] TARDRL: Task-Aware Reconstruction for Dynamic Representation Learning of fMRI
    Zhao, Yunxi
    Nie, Dong
    Chen, Geng
    Wu, Xia
    Zhang, Daoqiang
    Wen, Xuyun
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XI, 2024, 15011 : 700 - 710