Cross-Modal and Cross-Domain Knowledge Transfer for Label-Free 3D Segmentation

被引:0
|
作者
Zhang, Jingyu [1 ]
Yang, Huitong [2 ]
Wu, Dai-Jie [2 ]
Keung, Jacky [1 ]
Li, Xuesong [4 ]
Zhu, Xinge [3 ]
Ma, Yuexin [2 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia
基金
上海市自然科学基金;
关键词
Point Cloud Semantic Segmentation; Unsupervised Domain Adaptation; Cross-modal Transfer Learning;
D O I
10.1007/978-981-99-8435-0_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current state-of-the-art point cloud-based perception methods usually rely on large-scale labeled data, which requires expensive manual annotations. A natural option is to explore the unsupervised methodology for 3D perception tasks. However, such methods often face substantial performance-drop difficulties. Fortunately, we found that there exist amounts of image-based datasets and an alternative can be proposed, i.e., transferring the knowledge in the 2D images to 3D point clouds. Specifically, we propose a novel approach for the challenging cross-modal and cross-domain adaptation task by fully exploring the relationship between images and point clouds and designing effective feature alignment strategies. Without any 3D labels, our method achieves state-of-the-art performance for 3D point cloud semantic segmentation on SemanticKITTI by using the knowledge of KITTI360 and GTA5, compared to existing unsupervised and weakly-supervised baselines.
引用
收藏
页码:465 / 477
页数:13
相关论文
共 50 条
  • [1] Cross-Domain and Cross-Modal Knowledge Distillation in Domain Adaptation for 3D Semantic Segmentation
    Li, Miaoyu
    Zhang, Yachao
    Xie, Yuan
    Gao, Zuodong
    Li, Cuihua
    Zhang, Zhizhong
    Qu, Yanyun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3829 - 3837
  • [2] Cross-domain Cross-modal Food Transfer
    Zhu, Bin
    Ngo, Chong-Wah
    Chen, Jing-jing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3762 - 3770
  • [3] Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval
    Li, Fengling
    Wang, Bowen
    Zhu, Lei
    Li, Jingjing
    Zhang, Zheng
    Chang, Xiaojun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9664 - 9677
  • [4] Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation
    Jaritz, Maximilian
    Vu, Tuan-Hung
    de Charette, Raoul
    Wirbel, Émilie
    Pérez, Patrick
    arXiv, 2021,
  • [5] Cross-Modal Learning for Domain Adaptation in 3D Semantic Segmentation
    Jaritz, Maximilian
    Tuan-Hung Vu
    de Charette, Raoul
    Wirbel, Emilie
    Perez, Patrick
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1533 - 1544
  • [6] Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation
    Chen, Yiyang
    Zhao, Shanshan
    Ding, Changxing
    Tang, Liyao
    Wang, Chaoyue
    Tao, Dacheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3866 - 3875
  • [7] Cross-Modal Contrastive Learning for Domain Adaptation in 3D Semantic Segmentation
    Xing, Bowei
    Ying, Xianghua
    Wang, Ruibin
    Yang, Jinfa
    Chen, Taiyan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2974 - 2982
  • [8] CAFA: Cross-Modal Attentive Feature Alignment for Cross-Domain Urban Scene Segmentation
    Liu, Peng
    Ge, Yanqi
    Duan, Lixin
    Li, Wen
    Lv, Fengmao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (10) : 11666 - 11675
  • [9] Cross-Modal Center Loss for 3D Cross-Modal Retrieval
    Jing, Longlong
    Vahdani, Elahe
    Tan, Jiaxing
    Tian, Yingli
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3141 - 3150
  • [10] Cross-domain Knowledge Transfer Schemes for 3D Human Action Recognition
    Psaltis, Athanasios
    Papadopoulos, Georgios Th
    Daras, Petros
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,