Label-efficient object detection via region proposal network pre-training

被引:3
|
作者
Dong, Nanqing [1 ]
Ericsson, Linus [2 ]
Yang, Yongxin [3 ]
Leonardis, Ales [4 ]
Mcdonagh, Steven [2 ]
机构
[1] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[2] Univ Edinburgh, Inst Imaging Data & Commun IDCOM, Sch Engn, Edinburgh EH9 3FG, Scotland
[3] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[4] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
关键词
Self-supervised learning; Object detection;
D O I
10.1016/j.neucom.2024.127376
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self -supervised pre -training, based on the pretext task of instance discrimination, has fuelled the recent advance in label -efficient object detection. However, existing studies focus on pre -training only a feature extractor network to learn transferable representations for downstream detection tasks. This leads to the necessity of training multiple detection -specific modules from scratch in the fine-tuning phase. We argue that the region proposal network (RPN), a common detection -specific module, can additionally be pre -trained towards reducing the localization error of multi -stage detectors. In this work, we propose a simple pretext task that provides an effective pre -training for the RPN, towards efficiently improving downstream object detection performance. We evaluate the efficacy of our approach on benchmark object detection tasks and additional downstream tasks, including instance segmentation and few -shot detection. In comparison with multi -stage detectors without RPN pre -training, our approach is able to consistently improve downstream task performance, with largest gains found in label -scarce settings.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Masked Autoencoder for Pre-Training on 3D Point Cloud Object Detection
    Xie, Guangda
    Li, Yang
    Qu, Hongquan
    Sun, Zaiming
    MATHEMATICS, 2022, 10 (19)
  • [32] Efficient Conditional Pre-training for Transfer Learning
    Chakraborty, Shuvam
    Uzkent, Burak
    Ayush, Kumar
    Tanmay, Kumar
    Sheehan, Evan
    Ermon, Stefano
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4240 - 4249
  • [33] Multi-label rhinitis prediction using ensemble neural network chain with pre-training
    Yang, Jingdong
    Zhang, Meng
    Liu, Peng
    Yu, Shaoqing
    APPLIED SOFT COMPUTING, 2022, 122
  • [34] KL-DIVERGENCE-BASED REGION PROPOSAL NETWORK FOR OBJECT DETECTION
    Seo, Geonseok
    Yoo, Jaeyoung
    Choi, Jaeseok
    Kwak, Nojun
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2001 - 2005
  • [35] HIERARCHICAL REGION PROPOSAL REFINEMENT NETWORK FOR WEAKLY SUPERVISED OBJECT DETECTION
    Zhang, Ming
    Liu, Shuaicheng
    Zeng, Bing
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 669 - 673
  • [36] UFO RPN: A Region Proposal Network for Ultra Fast Object Detection
    Li, Wenkai
    Song, Andy
    AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 619 - 631
  • [37] RRPN: RADAR REGION PROPOSAL NETWORK FOR OBJECT DETECTION IN AUTONOMOUS VEHICLES
    Nabati, Ramin
    Qi, Hairong
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3093 - 3097
  • [38] Multi-label Patent Classification with Pre-training Model
    Xinyu T.
    Ruijie Z.
    Yonghe L.
    Data Analysis and Knowledge Discovery, 2022, 6 (2-3) : 129 - 137
  • [39] Anchor-free Proposal Generation Network for Efficient Object Detection
    Nguyen, Hoanh
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 327 - 335
  • [40] Improved OOD Generalization via Adversarial Training and Pre-training
    Yi, Mingyangi
    Hou, Lu
    Sun, Jiacheng
    Shang, Lifeng
    Jiang, Xin
    Liu, Qun
    Ma, Zhi-Ming
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139