Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Point Clouds

被引:5
|
作者
Yang, Ze [1 ]
Li, Ruibo [1 ]
Ling, Evan [2 ]
Zhang, Chi [1 ]
Wang, Yiming [1 ]
Huang, Dezhao [2 ]
Ma, Keng Teck [2 ]
Hur, Minhoe [3 ]
Lin, Guosheng [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Hyundai Motor Grp Innovat Ctr Singapore HMGICS, Singapore, Singapore
[3] Hyundai Motor Grp, AIRS Co, Singapore, Singapore
关键词
D O I
10.1109/ICCV51070.2023.01705
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continual semantic segmentation (CSS) aims to extend an existing model to tackle unseen tasks while retaining its old knowledge. Naively fine-tuning the old model on new data leads to catastrophic forgetting. A common solution is knowledge distillation (KD), where the output distribution of the new model is regularized to be similar to that of the old model. However, in CSS, this is challenging because of the background shift issue. Existing KD-based CSS methods continue to suffer from confusion between the background and novel classes since they fail to establish a reliable class correspondence for distillation. To address this issue, we propose a new label-guided knowledge distillation (LGKD) loss, where the old model output is expanded and transplanted (with the guidance of the ground truth label) to form a semantically appropriate class correspondence with the new model output. Consequently, the useful knowledge from the old model can be effectively distilled into the new model without causing confusion. We conduct extensive experiments on two prevailing CSS benchmarks, Pascal-VOC and ADE20K, where our LGKD significantly boosts the performance of three competing methods, especially on novel mIoU by up to +76%, setting new state-of-the-art. Finally, to further demonstrate its generalization ability, we introduce the first CSS benchmark for 3D point cloud based on ScanNet, along with several re-implemented baselines for comparison. Experiments show that LGKD is versatile in both 2D and 3D modalities without requiring ad hoc design. Codes are available at https://github.com/Ze-Yang/LGKD.
引用
收藏
页码:18555 / 18566
页数:12
相关论文
共 50 条
  • [1] 2D TO 3D LABEL PROPAGATION FOR THE SEMANTIC SEGMENTATION OF HERITAGE BUILDING POINT CLOUDS
    Pellis, E.
    Murtiyoso, A.
    Masiero, A.
    Tucci, G.
    Betti, M.
    Grussenmeyer, P.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 861 - 867
  • [2] SEMANTIC ENRICHMENT OF 3D POINT CLOUDS USING 2D IMAGE SEGMENTATION
    Rai, A.
    Srivastava, N.
    Khoshelham, K.
    Jain, K.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1659 - 1666
  • [3] An Integrated Model for Generating 3D Point Clouds and Segmentation from 2D Images
    Chen, Huan-Yu
    Lin, Chuen-Horng
    Lin, Yan-Yu
    Chan, Yung-Kuan
    2024 8TH INTERNATIONAL CONFERENCE ON IMAGING, SIGNAL PROCESSING AND COMMUNICATIONS, ICISPC 2024, 2024, : 94 - 101
  • [4] SEGCloud: Semantic Segmentation of 3D Point Clouds
    Tchapmi, Lyne P.
    Choy, Christopher B.
    Armeni, Iro
    Gwak, JunYoung
    Savarese, Silvio
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2017, : 537 - 547
  • [5] 3D Can Be Explored In 2D: Pseudo-Label Generation for LiDAR Point Clouds Using Sensor-Intensity-Based 2D Semantic Segmentation
    Caunes, Andrew
    Chateau, Thierry
    Fremont, Vincent
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 2192 - 2197
  • [6] Point attention network for semantic segmentation of 3D point clouds
    Feng, Mingtao
    Zhang, Liang
    Lin, Xuefei
    Gilani, Syed Zulqarnain
    Mian, Ajmal
    PATTERN RECOGNITION, 2020, 107 (107)
  • [7] GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds
    Zhang, Zihui
    Yang, Bo
    Wang, Bing
    Li, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17619 - 17629
  • [8] Superpoint-guided Semi-supervised Semantic Segmentation of 3D Point Clouds
    Deng, Shuang
    Dong, Qiulei
    Liu, Bo
    Hu, Zhanyi
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9214 - 9220
  • [9] 3D-MiniNet: Learning a 2D Representation From Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation
    Alonso, Inigo
    Riazuelo, Luis
    Montesano, Luis
    Murillo, Ana C.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 5432 - 5439
  • [10] Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images
    Wei, Zizhuang
    Wang, Yao
    Yi, Hongwei
    Chen, Yisong
    Wang, Guoping
    APPLIED SCIENCES-BASEL, 2020, 10 (04):