Label-Guided Knowledge Distillation for Continual Semantic Segmentation on 2D Images and 3D Point Clouds

被引：5

作者：

Yang, Ze ^{[1
]}

Li, Ruibo ^{[1
]}

Ling, Evan ^{[2
]}

Zhang, Chi ^{[1
]}

Wang, Yiming ^{[1
]}

Huang, Dezhao ^{[2
]}

Ma, Keng Teck ^{[2
]}

Hur, Minhoe ^{[3
]}

Lin, Guosheng ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[2] Hyundai Motor Grp Innovat Ctr Singapore HMGICS, Singapore, Singapore

[3] Hyundai Motor Grp, AIRS Co, Singapore, Singapore

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.01705

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Continual semantic segmentation (CSS) aims to extend an existing model to tackle unseen tasks while retaining its old knowledge. Naively fine-tuning the old model on new data leads to catastrophic forgetting. A common solution is knowledge distillation (KD), where the output distribution of the new model is regularized to be similar to that of the old model. However, in CSS, this is challenging because of the background shift issue. Existing KD-based CSS methods continue to suffer from confusion between the background and novel classes since they fail to establish a reliable class correspondence for distillation. To address this issue, we propose a new label-guided knowledge distillation (LGKD) loss, where the old model output is expanded and transplanted (with the guidance of the ground truth label) to form a semantically appropriate class correspondence with the new model output. Consequently, the useful knowledge from the old model can be effectively distilled into the new model without causing confusion. We conduct extensive experiments on two prevailing CSS benchmarks, Pascal-VOC and ADE20K, where our LGKD significantly boosts the performance of three competing methods, especially on novel mIoU by up to +76%, setting new state-of-the-art. Finally, to further demonstrate its generalization ability, we introduce the first CSS benchmark for 3D point cloud based on ScanNet, along with several re-implemented baselines for comparison. Experiments show that LGKD is versatile in both 2D and 3D modalities without requiring ad hoc design. Codes are available at https://github.com/Ze-Yang/LGKD.

引用

页码：18555 / 18566

页数：12

共 50 条

[21] Deep Scene Flow Learning: From 2D Images to 3D Point Clouds
Xiang, Xuezhi
Abdein, Rokia
Li, Wei
El Saddik, Abdulmotaleb
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (01) : 185 - 208
[22] Joint 2D and 3D Semantic Segmentation with Consistent Instance Semantic
Wan, Yingcai
Fang, Lijin
IEICE TRANSACTIONS ON COMMUNICATIONS, 2024, E107A (08) : 1309 - 1318
[23] SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks
Boulch, Alexandre
Guerry, Yids
Le Saux, Bertrand
Audebert, Nicolas
COMPUTERS & GRAPHICS-UK, 2018, 71 : 189 - 198
[24] DPRNet: Deep 3D Point Based Residual Network for Semantic Segmentation and Classification of 3D Point Clouds
Arshad, Saira
Shahzad, Muhammad
Riaz, Qaiser
Fraz, Muhammad Moazam
IEEE ACCESS, 2019, 7 : 68892 - 68904
[25] Online static point cloud map construction based on 3D point clouds and 2D images
Chi, Peng
Liao, Haipeng
Zhang, Qin
Wu, Xiangmiao
Tian, Jiyu
Wang, Zhenmin
VISUAL COMPUTER, 2024, 40 (04): : 2889 - 2904
[26] Online static point cloud map construction based on 3D point clouds and 2D images
Peng Chi
Haipeng Liao
Qin Zhang
Xiangmiao Wu
Jiyu Tian
Zhenmin Wang
The Visual Computer, 2024, 40 : 2889 - 2904
[27] Knowledge guided object detection and identification in 3D Point Clouds
Karmacharya, A.
Boochs, F.
Tietz, B.
VIDEOMETRICS, RANGE IMAGING, AND APPLICATIONS XIII, 2015, 9528
[28] Semantic Segmentation Networks of 3D Point Clouds for RGB-D Indoor Scenes
Wang, Ya
Zell, Andreas
TWELFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2019), 2020, 11433
[29] FAST SEMANTIC SEGMENTATION OF 3D POINT CLOUDS WITH STRONGLY VARYING DENSITY
Hackel, Timo
Wegner, Jan D.
Schindler, Konrad
XXIII ISPRS CONGRESS, COMMISSION III, 2016, 3 (03): : 177 - 184
[30] Hierarchical SVM for Semantic Segmentation of 3D Point Clouds for Infrastructure Scenes
Mansour, Mohamed
Martens, Jan
Blankenbach, Joerg
INFRASTRUCTURES, 2024, 9 (05)

← 1 2 3 4 5 →