Label-efficient object detection via region proposal network pre-training

被引:3
|
作者
Dong, Nanqing [1 ]
Ericsson, Linus [2 ]
Yang, Yongxin [3 ]
Leonardis, Ales [4 ]
Mcdonagh, Steven [2 ]
机构
[1] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[2] Univ Edinburgh, Inst Imaging Data & Commun IDCOM, Sch Engn, Edinburgh EH9 3FG, Scotland
[3] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[4] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
关键词
Self-supervised learning; Object detection;
D O I
10.1016/j.neucom.2024.127376
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self -supervised pre -training, based on the pretext task of instance discrimination, has fuelled the recent advance in label -efficient object detection. However, existing studies focus on pre -training only a feature extractor network to learn transferable representations for downstream detection tasks. This leads to the necessity of training multiple detection -specific modules from scratch in the fine-tuning phase. We argue that the region proposal network (RPN), a common detection -specific module, can additionally be pre -trained towards reducing the localization error of multi -stage detectors. In this work, we propose a simple pretext task that provides an effective pre -training for the RPN, towards efficiently improving downstream object detection performance. We evaluate the efficacy of our approach on benchmark object detection tasks and additional downstream tasks, including instance segmentation and few -shot detection. In comparison with multi -stage detectors without RPN pre -training, our approach is able to consistently improve downstream task performance, with largest gains found in label -scarce settings.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] LETCP: A Label-Efficient Transformer-Based Contrastive Pre-Training Method for Brain Tumor Segmentation
    Chen, Shoucun
    Zhang, Jing
    Zhang, Tianchi
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [2] Point-Level Region Contrast for Object Detection Pre-Training
    Bai, Yutong
    Chen, Xinlei
    Kirillov, Alexander
    Yuille, Alan
    Berg, Alexander C.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16040 - 16049
  • [3] Label-Efficient Online Continual Object Detection in Streaming Video
    Wu, Jay Zhangjie
    Zhang, David Junhao
    Hsu, Wynne
    Zhang, Mengmi
    Shou, Mike Zheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19189 - 19198
  • [4] Geospatial Object Detection via Deconvolutional Region Proposal Network
    Wang, Chen
    Shi, Jun
    Yang, Xiaqing
    Zhou, Yuanyuan
    Wei, Shunjun
    Li, Liang
    Zhang, Xiaoling
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (08) : 3014 - 3027
  • [5] Label-Efficient Video Object Segmentation With Motion Clues
    Lu, Yawen
    Zhang, Jie
    Sun, Su
    Guo, Qianyu
    Cao, Zhiwen
    Fei, Songlin
    Yang, Baijian
    Chen, Yingjie Victor
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 6710 - 6721
  • [6] Understanding the Effects of Pre-Training for Object Detectors via Eigenspectrum
    Shinya, Yosuke
    Simo-Serra, Edgar
    Suzuki, Taiji
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1931 - 1941
  • [7] Disentangled Pre-training for Human-Object Interaction Detection
    Li, Zhuolong
    Li, Xingao
    Ding, Changxing
    Xu, Xiangmin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 28191 - 28201
  • [8] DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
    Yao, Lewei
    Han, Jianhua
    Liang, Xiaodan
    Xu, Dan
    Zhang, Wei
    Li, Zhenguo
    Xu, Hang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23497 - 23506
  • [9] Label-efficient Segmentation via Affinity Propagation
    Li, Wentong
    Yuan, Yuqian
    Wang, Song
    Liu, Wenyu
    Tang, Dongqi
    Liu, Jian
    Zhu, Jianke
    Zhang, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Label-Efficient 3D Object Detection For Road-Side Units
    Dao, Minh-Quan
    Caesar, Holger
    Berrio, Julie Stephany
    Shan, Mao
    Worrall, Stewart
    Fremont, Vincent
    Malis, Ezio
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1572 - 1579