Adaptation via Proxy: Building Instance-Aware Proxy for Unsupervised Domain Adaptive 3D Object Detection

被引:2
|
作者
Li, Ziyu [1 ,2 ]
Yao, Yuncong [1 ,2 ]
Quan, Zhibin [1 ,2 ]
Qi, Lei [3 ]
Feng, Zhen-Hua [4 ]
Yang, Wankou [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China
[3] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, England
来源
基金
中国国家自然科学基金;
关键词
Object detection; intelligent vehicle perception; domain adaptation; point cloud; instance-aware; unsupervised learning; autonomous vehicles;
D O I
10.1109/TIV.2023.3343878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D detection task plays a crucial role in the perception system of intelligent vehicles. LiDAR-based 3D detectors perform well on particular autonomous driving benchmarks, but may poorly generalize to other domains. Existing 3D domain adaptive detection methods usually require annotation-related statistics or continuous refinement of pseudo-labels. The former is not always feasible for practical applications, while the latter lacks sufficient accurate supervision. In this work, we propose a novel unsupervised domain adaptive framework, namely Adaptation Via Proxy (AVP), that explicitly leverages cross-domain relationships to generate adequate high-quality samples, thus mitigating domain shifts for existing LiDAR-based 3D detectors. Specifically, we first train the detector on source domain with the curriculum example mining (CEM) strategy to enhance its generalization capability. Then, we integrate the profitable instance knowledge from the source domain with the contextual information from the target domain, to construct the instance-aware proxy, which is a data collection with diverse training scenes and stronger supervision. Finally, we fine-tune the pre-trained detector on the proxy data for further optimizing the detector to overcome domain gaps. To build the instance-aware proxy, two components are proposed, i.e., the multi-view multi-scale aggregation (MMA) method for producing high-quality pseudo-labels, and the hybrid instance augmentation (HIA) technique for integrating the knowledge from source annotations to enhance supervision. Note that AVP is architecture-agnostic thus it can be easily injected with any LiDAR-based 3D detectors. Extensive experiments on Waymo, nuScenes, KITTI and Lyft demonstrate the superiority of the proposed method over the state-of-the-art approaches for different adaptation scenarios.
引用
收藏
页码:3478 / 3492
页数:15
相关论文
共 50 条
  • [21] Pseudo Label Refinery for Unsupervised Domain Adaptation on Cross-dataset 3D Object Detection
    Zhang, Zhanwei
    Chen, Minghao
    Xiao, Shuai
    Peng, Liang
    la Li, Hen
    Lin, Binbin
    Li, Ping
    Wang, Wenxiao
    Wu, Boxi
    Cai, Deng
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15291 - 15300
  • [22] Domain Adaptive Subterranean 3D Pedestrian Detection via Instance Transfer and Confidence Guidance
    Liu, Zengyun
    Zheng, Zexun
    Qin, Tianyi
    Xu, Liying
    Zhang, Xu
    ELECTRONICS, 2024, 13 (05)
  • [23] Exploiting Playbacks in Unsupervised Domain Adaptation for 3D Object Detection in Self-Driving Cars
    You, Yurong
    Diaz-Ruiz, Carlos Andres
    Wang, Yan
    Chao, Wei-Lun
    Hariharan, Bharath
    Campbell, Mark
    Weinbergert, Kilian Q.
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 5070 - 5077
  • [24] 3D Object Trajectory Reconstruction using Instance-Aware Multibody Structure from Motion and Stereo Sequence Constraints
    Bullinger, Sebastian
    Bodensteiner, Christoph
    Arens, Michael
    Stiefelhagen, Rainer
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 466 - 473
  • [25] ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution
    Ngo, Tuan Duc
    Hua, Binh-Son
    Nguyen, Khoi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13550 - 13559
  • [26] 3D Domain Adaptive Instance Segmentation via Cyclic Segmentation GANs
    Lauenburg, Leander
    Lin, Zudi
    Zhang, Ruihan
    dos Santos, Marcia
    Huang, Siyu
    Arganda-Carreras, Ignacio
    Boyden, Edward S.
    Pfister, Hanspeter
    Wei, Donglai
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (08) : 4018 - 4027
  • [27] Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency
    Zhou, Xingyi
    Karpur, Arjun
    Gan, Chuang
    Luo, Linjie
    Huang, Qixing
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 141 - 157
  • [28] MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection
    Zhang, Chi
    Chen, Wenbo
    Wang, Wei
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6227 - 6240
  • [29] Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection
    Hu, Qianjiang
    Liu, Daizong
    Hu, Wei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17556 - 17566
  • [30] Stereo 3D object detection via instance depth prior guidance and adaptive spatial feature aggregation
    Ji, Chaofeng
    Liu, Guizhong
    Zhao, Dan
    VISUAL COMPUTER, 2023, 39 (10): : 4543 - 4554