Adaptation via Proxy: Building Instance-Aware Proxy for Unsupervised Domain Adaptive 3D Object Detection

被引：2

作者：

Li, Ziyu ^{[1
,2
]}

Yao, Yuncong ^{[1
,2
]}

Quan, Zhibin ^{[1
,2
]}

Qi, Lei ^{[3
]}

Feng, Zhen-Hua ^{[4
]}

Yang, Wankou ^{[1
,2
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China

[2] Southeast Univ, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China

[3] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China

[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, England

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Object detection; intelligent vehicle perception; domain adaptation; point cloud; instance-aware; unsupervised learning; autonomous vehicles;

D O I：

10.1109/TIV.2023.3343878

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D detection task plays a crucial role in the perception system of intelligent vehicles. LiDAR-based 3D detectors perform well on particular autonomous driving benchmarks, but may poorly generalize to other domains. Existing 3D domain adaptive detection methods usually require annotation-related statistics or continuous refinement of pseudo-labels. The former is not always feasible for practical applications, while the latter lacks sufficient accurate supervision. In this work, we propose a novel unsupervised domain adaptive framework, namely Adaptation Via Proxy (AVP), that explicitly leverages cross-domain relationships to generate adequate high-quality samples, thus mitigating domain shifts for existing LiDAR-based 3D detectors. Specifically, we first train the detector on source domain with the curriculum example mining (CEM) strategy to enhance its generalization capability. Then, we integrate the profitable instance knowledge from the source domain with the contextual information from the target domain, to construct the instance-aware proxy, which is a data collection with diverse training scenes and stronger supervision. Finally, we fine-tune the pre-trained detector on the proxy data for further optimizing the detector to overcome domain gaps. To build the instance-aware proxy, two components are proposed, i.e., the multi-view multi-scale aggregation (MMA) method for producing high-quality pseudo-labels, and the hybrid instance augmentation (HIA) technique for integrating the knowledge from source annotations to enhance supervision. Note that AVP is architecture-agnostic thus it can be easily injected with any LiDAR-based 3D detectors. Extensive experiments on Waymo, nuScenes, KITTI and Lyft demonstrate the superiority of the proposed method over the state-of-the-art approaches for different adaptation scenarios.

引用

页码：3478 / 3492

页数：15

共 50 条

[31] Stereo 3D object detection via instance depth prior guidance and adaptive spatial feature aggregation
Chaofeng Ji
Guizhong Liu
Dan Zhao
The Visual Computer, 2023, 39 : 4543 - 4554
[32] CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection
Zhang, Meiying
Peng, Weiyuan
Ding, Guangyao
Lei, Chenyang
Ji, Chunlin
Hao, Qi
2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS 2024, 2024, : 2624 - 2631
[33] Instance-Invariant Domain Adaptive Object Detection Via Progressive Disentanglement
Wu, Aming
Han, Yahong
Zhu, Linchao
Yang, Yi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (08) : 4178 - 4193
[34] Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency
Luo, Zhipeng
Cai, Zhongang
Zhou, Changqing
Zhang, Gongjie
Zhao, Haiyu
Yi, Shuai
Lu, Shijian
Li, Hongsheng
Zhang, Shanghang
Liu, Ziwei
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8846 - 8855
[35] TARGET-AWARE AUTO-AUGMENTATION FOR UNSUPERVISED DOMAIN ADAPTIVE OBJECT DETECTION
Li, Zhaoyang
Zhao, Long
Chen, Weijie
Yang, Shicai
Xie, Di
Pu, Shiliang
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3848 - 3852
[36] SF-UDA3D: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection
Saltori, Cristiano
Lathuiliere, Stephane
Sebe, Nicu
Ricci, Elisa
Galasso, Fabio
2020 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2020), 2020, : 771 - 780
[37] SRDAN: Scale-aware and Range-aware Domain Adaptation Network for Cross-dataset 3D Object Detection
Zhang, Weichen
Li, Wen
Xu, Dong
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6765 - 6775
[38] MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection
Chen, Xuesong
Shi, Shaoshuai
Zhu, Benjin
Cheung, Ka Chun
Xu, Hang
Li, Hongsheng
COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 680 - 697
[39] CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection
Peng, Xidong
Zhu, Xinge
Ma, Yuexin
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2047 - 2055
[40] Objformer: Boosting 3D object detection via instance-wise interaction
Tao, Manli
Zhao, Chaoyang
Tang, Ming
Wang, Jinqiao
PATTERN RECOGNITION, 2024, 146

← 1 2 3 4 5 →