Cross-Modal and Cross-Domain Knowledge Transfer for Label-Free 3D Segmentation

被引:0
|
作者
Zhang, Jingyu [1 ]
Yang, Huitong [2 ]
Wu, Dai-Jie [2 ]
Keung, Jacky [1 ]
Li, Xuesong [4 ]
Zhu, Xinge [3 ]
Ma, Yuexin [2 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia
基金
上海市自然科学基金;
关键词
Point Cloud Semantic Segmentation; Unsupervised Domain Adaptation; Cross-modal Transfer Learning;
D O I
10.1007/978-981-99-8435-0_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current state-of-the-art point cloud-based perception methods usually rely on large-scale labeled data, which requires expensive manual annotations. A natural option is to explore the unsupervised methodology for 3D perception tasks. However, such methods often face substantial performance-drop difficulties. Fortunately, we found that there exist amounts of image-based datasets and an alternative can be proposed, i.e., transferring the knowledge in the 2D images to 3D point clouds. Specifically, we propose a novel approach for the challenging cross-modal and cross-domain adaptation task by fully exploring the relationship between images and point clouds and designing effective feature alignment strategies. Without any 3D labels, our method achieves state-of-the-art performance for 3D point cloud semantic segmentation on SemanticKITTI by using the knowledge of KITTI360 and GTA5, compared to existing unsupervised and weakly-supervised baselines.
引用
收藏
页码:465 / 477
页数:13
相关论文
共 50 条
  • [21] Cross-modal 3D Shape Generation and Manipulation
    Cheng, Zezhou
    Chai, Menglei
    Ren, Jian
    Lee, Hsin-Ying
    Olszewski, Kyle
    Huang, Zeng
    Maji, Subhransu
    Tulyakov, Sergey
    COMPUTER VISION - ECCV 2022, PT III, 2022, 13663 : 303 - 321
  • [22] LabelDistill: Label-Guided Cross-Modal Knowledge Distillation for Camera-Based 3D Object Detection
    Kim, Sanmin
    Kim, Youngseok
    Hwang, Sihwan
    Jeong, Hyeonjun
    Kum, Dongsuk
    COMPUTER VISION - ECCV 2024, PT LVI, 2025, 15114 : 19 - 37
  • [23] ERP evidence for temporal differences between cross-modal and cross-domain analogical reasoning
    Zhao, Yanqun
    Guo, Jiajia
    Li, Yangzhuo
    Wu, Yuedong
    Luo, Junlong
    BEHAVIOURAL BRAIN RESEARCH, 2024, 470
  • [24] Cross-Domain 3D Model Retrieval Based On Contrastive Learning and Label Propagation
    Song, Dan
    Yang, Yue
    Nie, Weizhi
    Li, Xuanya
    Liu, An-An
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [25] Sparse-to-dense Feature Matching: Intra and Inter domain Cross-modal Learning in Domain Adaptation for 3D Semantic Segmentation
    Peng, Duo
    Lei, Yinjie
    Li, Wen
    Zhang, Pingping
    Guo, Yulan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7088 - 7097
  • [26] Multiple Knowledge Transfer for Cross-Domain Recommendation
    Do, Quan
    Verma, Sunny
    Chen, Fang
    Liu, Wei
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 529 - 542
  • [27] Graph Enabled Cross-Domain Knowledge Transfer
    Yao, Shibo
    ProQuest Dissertations and Theses Global, 2022,
  • [28] Robust Navigation with Cross-Modal Fusion and Knowledge Transfer
    Cai, Wenzhe
    Cheng, Guangran
    Kong, Lingyue
    Dong, Lu
    Sun, Changyin
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10233 - 10239
  • [29] Undoing the Damage of Label Shift for Cross-domain Semantic Segmentation
    Liu, Yahao
    Deng, Jinhong
    Tao, Jiale
    Chu, Tong
    Duan, Lixin
    Li, Wen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 7032 - 7042
  • [30] Universal Cross-Domain 3D Model Retrieval
    Song, Dan
    Li, Tian-Bao
    Li, Wen-Hui
    Nie, Wei-Zhi
    Liu, Wu
    Liu, An-An
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2721 - 2731