COLLABORATIVE SPATIAL-TEMPORAL DISTILLATION FOR EFFICIENT VIDEO DERAINING

被引：0

作者：

Hu, Yuzhang ^{[1
]}

Liu, Minghao ^{[1
]}

Yang, Wenhan ^{[2
]}

Liu, Jiaying ^{[1
]}

Guo, Zongming ^{[1
]}

机构：

[1] Peking Univ, Beijing, Peoples R China

[2] Peng Cheng Lab, Shenzhen, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年

基金：

中国国家自然科学基金;

关键词：

Video Deraining; Knowledge Distillation; Spatial Alignment; Temporal Alignment; Spatial-Temporal Adaptor; RAIN;

D O I：

10.1109/ICME55011.2023.00332

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose a novel knowledge distillation framework to improve the efficiency of deep networks for video deraining. The knowledge is transferred from a large-scale powerful teacher network to a compact efficient student network via the proposed collaborative spatial-temporal distillation framework. The framework is equipped with three collaboration schemes of different granularities that make use of spatial-temporal redundancy in a complementary way for better distillation performance. First, the spatial alignment module applies distillation constraints at different spatial scales to achieve better scale invariance in transferred knowledge. Second, the temporal alignment module traces both temporal status between teacher and student separately and collaboratively, to comprehensively utilize inter-frame information. Third, these two alignment modules interact through a spatial-temporal adaptor, where spatial-temporal knowledge is transferred in a unified framework. Extensive experiments demonstrate the superiority of our distillation framework as well as the effectiveness of each module. Our code is available at: https://github.com/HuYuzhang/Knowledge-Distillation.

引用

页码：1937 / 1942

页数：6

共 50 条

[31] A spatial-temporal approach for video caption detection and recognition
Tang, X
Gao, XB
Liu, JZ
Zhang, HJ
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 961 - 971
[32] Using Spatial-Temporal Attention for Video Quality Evaluation
Chi, Biwei
Su, Ruifang
Chen, Xinhui
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
[33] Enhanced spatial-temporal freedom for video frame interpolation
Li, Hao-Dong
Yin, Hui
Liu, Zhi-Hao
Huang, Hua
APPLIED INTELLIGENCE, 2023, 53 (09) : 10535 - 10547
[34] STAT: Spatial-Temporal Attention Mechanism for Video Captioning
Yan, Chenggang
Tu, Yunbin
Wang, Xingzheng
Zhang, Yongbing
Hao, Xinhong
Zhang, Yongdong
Dai, Qionghai
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (01) : 229 - 241
[35] A new spatial-temporal representation structure of symbolic video
Yu, Ping
ICIC Express Letters, 2011, 5 (11): : 4013 - 4019
[36] Spatial-temporal features for smoke detections on video images
Ma, Li
PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 1284 - 1291
[37] Video Scene Graph Generation with Spatial-Temporal Knowledge
Pu, Tao
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
[38] Contrast Based Hierarchical Spatial-Temporal Saliency for Video
Le, Trung-Nghia
Sugimoto, Akihiro
IMAGE AND VIDEO TECHNOLOGY, PSIVT 2015, 2016, 9431 : 734 - 748
[39] Video Quality Assessment Based on Spatial-temporal Distortion
Yang, Chunting
Liu, Yang
Yu, Jing
PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL I, 2009, : 818 - +
[40] Video Object Detection with an Aligned Spatial-Temporal Memory
Xiao, Fanyi
Lee, Yong Jae
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 494 - 510

← 1 2 3 4 5 →