Cross-Modal and Cross-Domain Knowledge Transfer for Label-Free 3D Segmentation

被引:0
|
作者
Zhang, Jingyu [1 ]
Yang, Huitong [2 ]
Wu, Dai-Jie [2 ]
Keung, Jacky [1 ]
Li, Xuesong [4 ]
Zhu, Xinge [3 ]
Ma, Yuexin [2 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Australian Natl Univ, Coll Sci, Canberra, ACT, Australia
基金
上海市自然科学基金;
关键词
Point Cloud Semantic Segmentation; Unsupervised Domain Adaptation; Cross-modal Transfer Learning;
D O I
10.1007/978-981-99-8435-0_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current state-of-the-art point cloud-based perception methods usually rely on large-scale labeled data, which requires expensive manual annotations. A natural option is to explore the unsupervised methodology for 3D perception tasks. However, such methods often face substantial performance-drop difficulties. Fortunately, we found that there exist amounts of image-based datasets and an alternative can be proposed, i.e., transferring the knowledge in the 2D images to 3D point clouds. Specifically, we propose a novel approach for the challenging cross-modal and cross-domain adaptation task by fully exploring the relationship between images and point clouds and designing effective feature alignment strategies. Without any 3D labels, our method achieves state-of-the-art performance for 3D point cloud semantic segmentation on SemanticKITTI by using the knowledge of KITTI360 and GTA5, compared to existing unsupervised and weakly-supervised baselines.
引用
收藏
页码:465 / 477
页数:13
相关论文
共 50 条
  • [31] Cross-Domain 3D Equivariant Image Embeddings
    Esteves, Carlos
    Sud, Avneesh
    Luo, Zhengyi
    Daniilidis, Kostas
    Makadia, Ameesh
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [32] Supervised Contrastive Learning for 3D Cross-Modal Retrieval
    Choo, Yeon-Seung
    Kim, Boeun
    Kim, Hyun-Sik
    Park, Yong-Suk
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [33] PointAugmenting: Cross-Modal Augmentation for 3D Object Detection
    Wang, Chunwei
    Ma, Chao
    Zhu, Ming
    Yang, Xiaokang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11789 - 11798
  • [34] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
    Yuan, Zhihao
    Yan, Xu
    Liao, Yinghong
    Guo, Yao
    Li, Guanbin
    Cui, Shuguang
    Li, Zhen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8553 - 8563
  • [35] Cross-modal semantic transfer for point cloud semantic segmentation
    Cao, Zhen
    Mi, Xiaoxin
    Qiu, Bo
    Cao, Zhipeng
    Long, Chen
    Yan, Xinrui
    Zheng, Chao
    Dong, Zhen
    Yang, Bisheng
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2025, 221 : 265 - 279
  • [36] ProtoTransfer: Cross-Modal Prototype Transfer for Point Cloud Segmentation
    Tang, Pin
    Xu, Hai-Ming
    Ma, Chao
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3314 - 3324
  • [37] Cross-Domain Recommendation with Cross-Graph Knowledge Transfer Network
    Ouyang, Yi
    Guo, Bin
    Wang, Qianru
    Yu, Zhiwen
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [38] Domain-Oriented Knowledge Transfer for Cross-Domain Recommendation
    Zhao, Guoshuai
    Zhang, Xiaolong
    Tang, Hao
    Shen, Jialie
    Qian, Xueming
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9539 - 9550
  • [39] MCKTNet: Multiscale Cross-Modal Knowledge Transfer Network for Semantic Segmentation of Remote Sensing Images
    Cui, Jian
    Liu, Jiahang
    Ni, Yue
    Sun, Yuan
    Guo, Mao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [40] CC-DA: CROSS-DOMAIN CONSISTENCY DATA AUGMENTATION FOR 3D TUMOR SEGMENTATION
    He, Jiezhou
    Luo, Zhiming
    Peng, Wei
    Su, Songzhi
    Li, Shaozi
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1936 - 1940