Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Point Clouds

被引:5
|
作者
Yang, Ze [1 ]
Li, Ruibo [1 ]
Ling, Evan [2 ]
Zhang, Chi [1 ]
Wang, Yiming [1 ]
Huang, Dezhao [2 ]
Ma, Keng Teck [2 ]
Hur, Minhoe [3 ]
Lin, Guosheng [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Hyundai Motor Grp Innovat Ctr Singapore HMGICS, Singapore, Singapore
[3] Hyundai Motor Grp, AIRS Co, Singapore, Singapore
关键词
D O I
10.1109/ICCV51070.2023.01705
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continual semantic segmentation (CSS) aims to extend an existing model to tackle unseen tasks while retaining its old knowledge. Naively fine-tuning the old model on new data leads to catastrophic forgetting. A common solution is knowledge distillation (KD), where the output distribution of the new model is regularized to be similar to that of the old model. However, in CSS, this is challenging because of the background shift issue. Existing KD-based CSS methods continue to suffer from confusion between the background and novel classes since they fail to establish a reliable class correspondence for distillation. To address this issue, we propose a new label-guided knowledge distillation (LGKD) loss, where the old model output is expanded and transplanted (with the guidance of the ground truth label) to form a semantically appropriate class correspondence with the new model output. Consequently, the useful knowledge from the old model can be effectively distilled into the new model without causing confusion. We conduct extensive experiments on two prevailing CSS benchmarks, Pascal-VOC and ADE20K, where our LGKD significantly boosts the performance of three competing methods, especially on novel mIoU by up to +76%, setting new state-of-the-art. Finally, to further demonstrate its generalization ability, we introduce the first CSS benchmark for 3D point cloud based on ScanNet, along with several re-implemented baselines for comparison. Experiments show that LGKD is versatile in both 2D and 3D modalities without requiring ad hoc design. Codes are available at https://github.com/Ze-Yang/LGKD.
引用
收藏
页码:18555 / 18566
页数:12
相关论文
共 50 条
  • [21] Deep Scene Flow Learning: From 2D Images to 3D Point Clouds
    Xiang, Xuezhi
    Abdein, Rokia
    Li, Wei
    El Saddik, Abdulmotaleb
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 185 - 208
  • [22] Joint 2D and 3D Semantic Segmentation with Consistent Instance Semantic
    Wan, Yingcai
    Fang, Lijin
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1309 - 1318
  • [23] SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks
    Boulch, Alexandre
    Guerry, Yids
    Le Saux, Bertrand
    Audebert, Nicolas
    COMPUTERS & GRAPHICS-UK, 2018, 71 : 189 - 198
  • [24] DPRNet: Deep 3D Point Based Residual Network for Semantic Segmentation and Classification of 3D Point Clouds
    Arshad, Saira
    Shahzad, Muhammad
    Riaz, Qaiser
    Fraz, Muhammad Moazam
    IEEE ACCESS, 2019, 7 : 68892 - 68904
  • [25] Online static point cloud map construction based on 3D point clouds and 2D images
    Chi, Peng
    Liao, Haipeng
    Zhang, Qin
    Wu, Xiangmiao
    Tian, Jiyu
    Wang, Zhenmin
    VISUAL COMPUTER, 2024, 40 (04): : 2889 - 2904
  • [26] Online static point cloud map construction based on 3D point clouds and 2D images
    Peng Chi
    Haipeng Liao
    Qin Zhang
    Xiangmiao Wu
    Jiyu Tian
    Zhenmin Wang
    The Visual Computer, 2024, 40 : 2889 - 2904
  • [27] Knowledge guided object detection and identification in 3D Point Clouds
    Karmacharya, A.
    Boochs, F.
    Tietz, B.
    VIDEOMETRICS, RANGE IMAGING, AND APPLICATIONS XIII, 2015, 9528
  • [28] Semantic Segmentation Networks of 3D Point Clouds for RGB-D Indoor Scenes
    Wang, Ya
    Zell, Andreas
    TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
  • [29] FAST SEMANTIC SEGMENTATION OF 3D POINT CLOUDS WITH STRONGLY VARYING DENSITY
    Hackel, Timo
    Wegner, Jan D.
    Schindler, Konrad
    XXIII ISPRS CONGRESS, COMMISSION III, 2016, 3 (03): : 177 - 184
  • [30] Hierarchical SVM for Semantic Segmentation of 3D Point Clouds for Infrastructure Scenes
    Mansour, Mohamed
    Martens, Jan
    Blankenbach, Joerg
    INFRASTRUCTURES, 2024, 9 (05)