Spatial-Temporal Sequence Attention Based Efficient Transformer for Video Snow Removal

被引:0
|
作者
Gao, Tao [1 ]
Zhang, Qianxi [2 ]
Chen, Ting [3 ]
Wen, Yuanbo [3 ]
机构
[1] Changan Univ, Sch Data Sci & Artificial Intelligence, Xian 710064, Peoples R China
[2] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
[3] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
来源
BIG DATA MINING AND ANALYTICS | 2025年 / 8卷 / 03期
基金
中国国家自然科学基金;
关键词
video restoration; vision Transformer; window attention; computer vision; neural representation;
D O I
10.26599/BDMA.2024.9020061
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video snow removal has tremendous potential in enhancing video quality and boosting the performance of computer vision tasks. Recently, Transformers have gained attention for the self-attention mechanism. However, the memory consumption of self-attention is considerable, limiting its application in high-resolution video restoration. In this paper, we propose an efficient video desnowing spatio-temporal Transformer, which utilizes spatio-temporal sequence attention to parallelly capture intra-frame spatial information and inter-frame temporal information, with much lower memory consumption compared to standard self-attention. Additionally, we mitigate the impact of snowflake occlusion on video frame alignment by leveraging an atmospheric scattering model. Furthermore, we introduce the concept of Neural Representation for Videos (NeRV) and effectively reconstruct compressed videos after multi-resolution feature extraction using the recovery NeRV module, thereby further reducing computational consumption. Extensive experiments demonstrate that the model achieves superior performance in video snow removal while significantly reducing computational resources.
引用
收藏
页码:551 / 562
页数:12
相关论文
共 50 条
  • [1] Video Description with Spatial-Temporal Attention
    Tu, Yunbin
    Zhang, Xishan
    Liu, Bingtao
    Yan, Chenggang
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1014 - 1022
  • [2] Scene Retrieval in Soccer Videos by Spatial-temporal Attention with Video Vision Transformer
    Gan, Yaozong
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Mild
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 453 - 454
  • [3] Collaborative spatial-temporal video salient object detection with cross attention transformer
    Su, Yuting
    Wang, Weikang
    Liu, Jing
    Jing, Peiguang
    SIGNAL PROCESSING, 2024, 224
  • [4] Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description
    Shen, Kai
    Wu, Lingfei
    Xu, Fangli
    Tang, Siliang
    Xiao, Jun
    Zhuang, Yueting
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 941 - 947
  • [5] Video Compression Artifacts Removal With Spatial-Temporal Attention-Guided Enhancement
    Jiang, Nanfeng
    Chen, Weiling
    Lin, Jielian
    Zhao, Tiesong
    Lin, Chia-Wen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5657 - 5669
  • [6] SPATIAL-TEMPORAL ATTENTION ANALYSIS FOR HOME VIDEO
    Qiu, Xuekan
    Jiang, Shuqiang
    Liu, Huiying
    Huang, Qingming
    Cao, Longbing
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1517 - +
  • [7] Spatial-Temporal Transformer for Video Snapshot Compressive Imaging
    Wang, Lishun
    Cao, Miao
    Zhong, Yong
    Yuan, Xin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 9072 - 9089
  • [8] ShiftFormer: Spatial-Temporal Shift Operation in Video Transformer
    Yang, Beiying
    Zhu, Guibo
    Ge, Guojing
    Luo, Jinzhao
    Wang, Jinqiao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1895 - 1900
  • [9] Unsupervised Low-Light Video Enhancement With Spatial-Temporal Co-Attention Transformer
    Lv, Xiaoqian
    Zhang, Shengping
    Wang, Chenyang
    Zhang, Weigang
    Yao, Hongxun
    Huang, Qingming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4701 - 4715
  • [10] Attention-based spatial-temporal graph transformer for traffic flow forecasting
    Qingyong Zhang
    Wanfeng Chang
    Changwu Li
    Conghui Yin
    Yixin Su
    Peng Xiao
    Neural Computing and Applications, 2023, 35 : 21827 - 21839