Cross-Modal and Cross-Domain Knowledge Transfer for Label-Free 3D Segmentation

被引：0

作者：

Zhang, Jingyu ^{[1
]}

Yang, Huitong ^{[2
]}

Wu, Dai-Jie ^{[2
]}

Keung, Jacky ^{[1
]}

Li, Xuesong ^{[4
]}

Zhu, Xinge ^{[3
]}

Ma, Yuexin ^{[2
]}

机构：

[1] City Univ Hong Kong, Hong Kong, Peoples R China

[2] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[4] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III | 2024年 / 14427卷

基金：

上海市自然科学基金;

关键词：

Point Cloud Semantic Segmentation; Unsupervised Domain Adaptation; Cross-modal Transfer Learning;

D O I：

10.1007/978-981-99-8435-0_37

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current state-of-the-art point cloud-based perception methods usually rely on large-scale labeled data, which requires expensive manual annotations. A natural option is to explore the unsupervised methodology for 3D perception tasks. However, such methods often face substantial performance-drop difficulties. Fortunately, we found that there exist amounts of image-based datasets and an alternative can be proposed, i.e., transferring the knowledge in the 2D images to 3D point clouds. Specifically, we propose a novel approach for the challenging cross-modal and cross-domain adaptation task by fully exploring the relationship between images and point clouds and designing effective feature alignment strategies. Without any 3D labels, our method achieves state-of-the-art performance for 3D point cloud semantic segmentation on SemanticKITTI by using the knowledge of KITTI360 and GTA5, compared to existing unsupervised and weakly-supervised baselines.

引用

页码：465 / 477

页数：13

共 50 条

[21] Cross-modal 3D Shape Generation and Manipulation
Cheng, Zezhou
Chai, Menglei
Ren, Jian
Lee, Hsin-Ying
Olszewski, Kyle
Huang, Zeng
Maji, Subhransu
Tulyakov, Sergey
COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 303 - 321
[22] LabelDistill: Label-Guided Cross-Modal Knowledge Distillation for Camera-Based 3D Object Detection
Kim, Sanmin
Kim, Youngseok
Hwang, Sihwan
Jeong, Hyeonjun
Kum, Dongsuk
COMPUTER VISION - ECCV 2024, PT LVI, 2025, 15114 : 19 - 37
[23] ERP evidence for temporal differences between cross-modal and cross-domain analogical reasoning
Zhao, Yanqun
Guo, Jiajia
Li, Yangzhuo
Wu, Yuedong
Luo, Junlong
BEHAVIOURAL BRAIN RESEARCH, 2024, 470
[24] Cross-Domain 3D Model Retrieval Based On Contrastive Learning and Label Propagation
Song, Dan
Yang, Yue
Nie, Weizhi
Li, Xuanya
Liu, An-An
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
[25] Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation
Peng, Duo
Lei, Yinjie
Li, Wen
Zhang, Pingping
Guo, Yulan
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7088 - 7097
[26] Multiple Knowledge Transfer for Cross-Domain Recommendation
Do, Quan
Verma, Sunny
Chen, Fang
Liu, Wei
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 529 - 542
[27] Graph Enabled Cross-Domain Knowledge Transfer
Yao, Shibo
ProQuest Dissertations and Theses Global, 2022,
[28] Robust Navigation with Cross-Modal Fusion and Knowledge Transfer
Cai, Wenzhe
Cheng, Guangran
Kong, Lingyue
Dong, Lu
Sun, Changyin
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10233 - 10239
[29] Undoing the Damage of Label Shift for Cross-domain Semantic Segmentation
Liu, Yahao
Deng, Jinhong
Tao, Jiale
Chu, Tong
Duan, Lixin
Li, Wen
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 7032 - 7042
[30] Universal Cross-Domain 3D Model Retrieval
Song, Dan
Li, Tian-Bao
Li, Wen-Hui
Nie, Wei-Zhi
Liu, Wu
Liu, An-An
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2721 - 2731

← 1 2 3 4 5 →