COLLABORATIVE SPATIAL-TEMPORAL DISTILLATION FOR EFFICIENT VIDEO DERAINING

被引:0
|
作者
Hu, Yuzhang [1 ]
Liu, Minghao [1 ]
Yang, Wenhan [2 ]
Liu, Jiaying [1 ]
Guo, Zongming [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Video Deraining; Knowledge Distillation; Spatial Alignment; Temporal Alignment; Spatial-Temporal Adaptor; RAIN;
D O I
10.1109/ICME55011.2023.00332
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel knowledge distillation framework to improve the efficiency of deep networks for video deraining. The knowledge is transferred from a large-scale powerful teacher network to a compact efficient student network via the proposed collaborative spatial-temporal distillation framework. The framework is equipped with three collaboration schemes of different granularities that make use of spatial-temporal redundancy in a complementary way for better distillation performance. First, the spatial alignment module applies distillation constraints at different spatial scales to achieve better scale invariance in transferred knowledge. Second, the temporal alignment module traces both temporal status between teacher and student separately and collaboratively, to comprehensively utilize inter-frame information. Third, these two alignment modules interact through a spatial-temporal adaptor, where spatial-temporal knowledge is transferred in a unified framework. Extensive experiments demonstrate the superiority of our distillation framework as well as the effectiveness of each module. Our code is available at: https://github.com/HuYuzhang/Knowledge-Distillation.
引用
收藏
页码:1937 / 1942
页数:6
相关论文
共 50 条
  • [31] A spatial-temporal approach for video caption detection and recognition
    Tang, X
    Gao, XB
    Liu, JZ
    Zhang, HJ
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 961 - 971
  • [32] Using Spatial-Temporal Attention for Video Quality Evaluation
    Chi, Biwei
    Su, Ruifang
    Chen, Xinhui
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
  • [33] Enhanced spatial-temporal freedom for video frame interpolation
    Li, Hao-Dong
    Yin, Hui
    Liu, Zhi-Hao
    Huang, Hua
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10535 - 10547
  • [34] STAT: Spatial-Temporal Attention Mechanism for Video Captioning
    Yan, Chenggang
    Tu, Yunbin
    Wang, Xingzheng
    Zhang, Yongbing
    Hao, Xinhong
    Zhang, Yongdong
    Dai, Qionghai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (01) : 229 - 241
  • [35] A new spatial-temporal representation structure of symbolic video
    Yu, Ping
    ICIC Express Letters, 2011, 5 (11): : 4013 - 4019
  • [36] Spatial-temporal features for smoke detections on video images
    Ma, Li
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 1284 - 1291
  • [37] Video Scene Graph Generation with Spatial-Temporal Knowledge
    Pu, Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
  • [38] Contrast Based Hierarchical Spatial-Temporal Saliency for Video
    Le, Trung-Nghia
    Sugimoto, Akihiro
    IMAGE AND VIDEO TECHNOLOGY, PSIVT 2015, 2016, 9431 : 734 - 748
  • [39] Video Quality Assessment Based on Spatial-temporal Distortion
    Yang, Chunting
    Liu, Yang
    Yu, Jing
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL I, 2009, : 818 - +
  • [40] Video Object Detection with an Aligned Spatial-Temporal Memory
    Xiao, Fanyi
    Lee, Yong Jae
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 494 - 510