Cross-Modal and Cross-Domain Knowledge Transfer for Label-Free 3D Segmentation

被引：0

作者：

Zhang, Jingyu ^{[1
]}

Yang, Huitong ^{[2
]}

Wu, Dai-Jie ^{[2
]}

Keung, Jacky ^{[1
]}

Li, Xuesong ^{[4
]}

Zhu, Xinge ^{[3
]}

Ma, Yuexin ^{[2
]}

机构：

[1] City Univ Hong Kong, Hong Kong, Peoples R China

[2] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[4] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III | 2024年 / 14427卷

基金：

上海市自然科学基金;

关键词：

Point Cloud Semantic Segmentation; Unsupervised Domain Adaptation; Cross-modal Transfer Learning;

D O I：

10.1007/978-981-99-8435-0_37

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current state-of-the-art point cloud-based perception methods usually rely on large-scale labeled data, which requires expensive manual annotations. A natural option is to explore the unsupervised methodology for 3D perception tasks. However, such methods often face substantial performance-drop difficulties. Fortunately, we found that there exist amounts of image-based datasets and an alternative can be proposed, i.e., transferring the knowledge in the 2D images to 3D point clouds. Specifically, we propose a novel approach for the challenging cross-modal and cross-domain adaptation task by fully exploring the relationship between images and point clouds and designing effective feature alignment strategies. Without any 3D labels, our method achieves state-of-the-art performance for 3D point cloud semantic segmentation on SemanticKITTI by using the knowledge of KITTI360 and GTA5, compared to existing unsupervised and weakly-supervised baselines.

引用

页码：465 / 477

页数：13

共 50 条

[1] Cross-Domain and Cross-Modal Knowledge Distillation in Domain Adaptation for 3D Semantic Segmentation
Li, Miaoyu
Zhang, Yachao
Xie, Yuan
Gao, Zuodong
Li, Cuihua
Zhang, Zhizhong
Qu, Yanyun
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3829 - 3837
[2] Cross-domain Cross-modal Food Transfer
Zhu, Bin
Ngo, Chong-Wah
Chen, Jing-jing
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3762 - 3770
[3] Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval
Li, Fengling
Wang, Bowen
Zhu, Lei
Li, Jingjing
Zhang, Zheng
Chang, Xiaojun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9664 - 9677
[4] Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation
Jaritz, Maximilian
Vu, Tuan-Hung
de Charette, Raoul
Wirbel, Émilie
Pérez, Patrick
arXiv, 2021,
[5] Cross-Modal Learning for Domain Adaptation in 3D Semantic Segmentation
Jaritz, Maximilian
Tuan-Hung Vu
de Charette, Raoul
Wirbel, Emilie
Perez, Patrick
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1533 - 1544
[6] Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation
Chen, Yiyang
Zhao, Shanshan
Ding, Changxing
Tang, Liyao
Wang, Chaoyue
Tao, Dacheng
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3866 - 3875
[7] Cross-Modal Contrastive Learning for Domain Adaptation in 3D Semantic Segmentation
Xing, Bowei
Ying, Xianghua
Wang, Ruibin
Yang, Jinfa
Chen, Taiyan
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2974 - 2982
[8] CAFA: Cross-Modal Attentive Feature Alignment for Cross-Domain Urban Scene Segmentation
Liu, Peng
Ge, Yanqi
Duan, Lixin
Li, Wen
Lv, Fengmao
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (10) : 11666 - 11675
[9] Cross-Modal Center Loss for 3D Cross-Modal Retrieval
Jing, Longlong
Vahdani, Elahe
Tan, Jiaxing
Tian, Yingli
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3141 - 3150
[10] Cross-domain Knowledge Transfer Schemes for 3D Human Action Recognition
Psaltis, Athanasios
Papadopoulos, Georgios Th
Daras, Petros
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,

← 1 2 3 4 5 →