Adaptation via Proxy: Building Instance-Aware Proxy for Unsupervised Domain Adaptive 3D Object Detection

被引:2
|
作者
Li, Ziyu [1 ,2 ]
Yao, Yuncong [1 ,2 ]
Quan, Zhibin [1 ,2 ]
Qi, Lei [3 ]
Feng, Zhen-Hua [4 ]
Yang, Wankou [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China
[3] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, England
来源
基金
中国国家自然科学基金;
关键词
Object detection; intelligent vehicle perception; domain adaptation; point cloud; instance-aware; unsupervised learning; autonomous vehicles;
D O I
10.1109/TIV.2023.3343878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D detection task plays a crucial role in the perception system of intelligent vehicles. LiDAR-based 3D detectors perform well on particular autonomous driving benchmarks, but may poorly generalize to other domains. Existing 3D domain adaptive detection methods usually require annotation-related statistics or continuous refinement of pseudo-labels. The former is not always feasible for practical applications, while the latter lacks sufficient accurate supervision. In this work, we propose a novel unsupervised domain adaptive framework, namely Adaptation Via Proxy (AVP), that explicitly leverages cross-domain relationships to generate adequate high-quality samples, thus mitigating domain shifts for existing LiDAR-based 3D detectors. Specifically, we first train the detector on source domain with the curriculum example mining (CEM) strategy to enhance its generalization capability. Then, we integrate the profitable instance knowledge from the source domain with the contextual information from the target domain, to construct the instance-aware proxy, which is a data collection with diverse training scenes and stronger supervision. Finally, we fine-tune the pre-trained detector on the proxy data for further optimizing the detector to overcome domain gaps. To build the instance-aware proxy, two components are proposed, i.e., the multi-view multi-scale aggregation (MMA) method for producing high-quality pseudo-labels, and the hybrid instance augmentation (HIA) technique for integrating the knowledge from source annotations to enhance supervision. Note that AVP is architecture-agnostic thus it can be easily injected with any LiDAR-based 3D detectors. Extensive experiments on Waymo, nuScenes, KITTI and Lyft demonstrate the superiority of the proposed method over the state-of-the-art approaches for different adaptation scenarios.
引用
收藏
页码:3478 / 3492
页数:15
相关论文
共 50 条
  • [1] Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery
    Grinvald, Margarita
    Furrer, Fadri
    Novkovic, Tonci
    Chung, Jen Jen
    Cadena, Cesar
    Siegwart, Roland
    Nieto, Juan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03) : 3037 - 3044
  • [2] FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection
    Wang, Yuqi
    Chen, Yuntao
    Zhang, Zhaoxiang
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5096 - 5105
  • [3] Unsupervised Domain Adaptation for 3D Object Detection via Self-Training
    Luo, Di
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT II, 2024, 14426 : 307 - 318
  • [4] SPG: Unsupervised Domain Adaptation for 3D Object Detection via Semantic Point Generation
    Xu, Qiangeng
    Zhou, Yin
    Wang, Weiyue
    Qi, Charles R.
    Anguelov, Dragomir
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15426 - 15436
  • [5] Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-training
    Li, Zhenyu
    Chen, Zehui
    Li, Ang
    Fang, Liangji
    Jiang, Qinhong
    Liu, Xianming
    Jiang, Junjun
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 245 - 262
  • [6] INSTANCE-AWARE SIMPLIFICATION OF 3D POLYGONAL MESHES
    Azim, Tahir
    Cheslack-Postava, Ewen
    Levis, Philip
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [7] Instance-aware sampling and voxel-transformer encoding for single-stage 3D object detection
    Wang, Baotong
    Xia, Chenxing
    Gao, Xiuju
    Yang, Yuan
    Li, Kuan-Ching
    Fang, Xianjin
    Zhang, Yan
    Ge, Sijia
    DIGITAL SIGNAL PROCESSING, 2025, 162
  • [8] PLS: UNSUPERVISED DOMAIN ADAPTATION FOR 3D OBJECT DETECTION VIA PSEUDO-LABEL SIZES
    Chen, Shijie
    Wang, Rongquan
    Li, Xin
    Wu, Yuchen
    Liu, Haizhuang
    Chen, Jiansheng
    Ma, Huimin
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6370 - 6374
  • [9] Unsupervised Subcategory Domain Adaptive Network for 3D Object Detection in LiDAR
    Wang, Zhiyu
    Wang, Li
    Xiao, Liang
    Dai, Bin
    ELECTRONICS, 2021, 10 (08)
  • [10] Instance-Aware Monocular 3D Semantic Scene Completion
    Xiao, Haihong
    Xu, Hongbin
    Kang, Wenxiong
    Li, Yuqiong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (07) : 6543 - 6554