Cross-Modal and Cross-Domain Knowledge Transfer for Label-Free 3D Segmentation

被引:0
|
作者
Zhang, Jingyu [1 ]
Yang, Huitong [2 ]
Wu, Dai-Jie [2 ]
Keung, Jacky [1 ]
Li, Xuesong [4 ]
Zhu, Xinge [3 ]
Ma, Yuexin [2 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia
基金
上海市自然科学基金;
关键词
Point Cloud Semantic Segmentation; Unsupervised Domain Adaptation; Cross-modal Transfer Learning;
D O I
10.1007/978-981-99-8435-0_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current state-of-the-art point cloud-based perception methods usually rely on large-scale labeled data, which requires expensive manual annotations. A natural option is to explore the unsupervised methodology for 3D perception tasks. However, such methods often face substantial performance-drop difficulties. Fortunately, we found that there exist amounts of image-based datasets and an alternative can be proposed, i.e., transferring the knowledge in the 2D images to 3D point clouds. Specifically, we propose a novel approach for the challenging cross-modal and cross-domain adaptation task by fully exploring the relationship between images and point clouds and designing effective feature alignment strategies. Without any 3D labels, our method achieves state-of-the-art performance for 3D point cloud semantic segmentation on SemanticKITTI by using the knowledge of KITTI360 and GTA5, compared to existing unsupervised and weakly-supervised baselines.
引用
收藏
页码:465 / 477
页数:13
相关论文
共 50 条
  • [11] Source-Data-Free Cross-Domain Knowledge Transfer for Semantic Segmentation
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 92 - 100
  • [12] Cross-modal knowledge transfer for 3D point clouds via graph offset
    Zhang, Huang
    Yu, Long
    Wang, Guoqi
    Tian, Shengwei
    Yu, Zaiyang
    Li, Weijun
    Ning, Xin
    PATTERN RECOGNITION, 2025, 162
  • [13] Cross-modal Attribute Transfer for Rescaling 3D Models
    Shao, Lin
    Chang, Angel X.
    Su, Hao
    Savva, Manolis
    Guibas, Leonidas
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 640 - 648
  • [14] Cross-modal knowledge transfer for 3D point clouds via graph offset prediction
    Zhang, Huang
    Yu, Long
    Wang, Guoqi
    Tian, Shengwei
    Yu, Zaiyang
    Li, Weijun
    Ning, Xin
    Pattern Recognition, 162
  • [15] Self-supervised Exclusive Learning for 3D Segmentation with Cross-modal Unsupervised Domain Adaptation
    Zhang, Yachao
    Li, Miaoyu
    Xie, Yuan
    Li, Cuihua
    Wang, Cong
    Zhang, Zhizhong
    Qu, Yanyun
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3338 - 3346
  • [16] 3D shape knowledge graph for cross-domain 3D shape retrieval
    Chang, Rihao
    Ma, Yongtao
    Hao, Tong
    Wang, Weijie
    Nie, Weizhi
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (05) : 1199 - 1216
  • [17] Applying an Embodied Cognition Perspective to Cross-Modal and Cross-Domain Color Associations
    Loeffler, Diana
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 1135 - 1135
  • [18] Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation
    Zhao, Wentian
    Wu, Xinxiao
    Luo, Jiebo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1180 - 1192
  • [19] Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
    Liu, Yang
    Chen, Qingchao
    Albanie, Samuel
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14949 - 14959
  • [20] Cross-modal Unsupervised Domain Adaptation for 3D Semantic Segmentation via Bidirectional Fusion-then-Distillation
    Wu, Yao
    Xing, Mingwei
    Zhang, Yachao
    Xie, Yuan
    Fan, Jianping
    Shi, Zhongchao
    Qu, Yanyun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 490 - 498