End-To-End Compression for Surveillance Video With Unsupervised Foreground-Background Separation

被引:9
|
作者
Zhao, Yu [1 ]
Luo, Dengyan [1 ]
Wang, Fuchun [1 ]
Gao, Han [1 ]
Ye, Mao [1 ]
Zhu, Ce [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Encoding; Surveillance; Video compression; Video coding; Neural networks; Deep learning; Streaming media; foreground-background separation; surveillance video; PREDICTION; CASCADE; FRAMES; HEVC;
D O I
10.1109/TBC.2023.3280039
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the exponential growth of surveillance video, efficient video coding method is in great demand. The learning-based methods emerge which either directly use a general video compression framework, or separate the foreground and background and then compress them in two stages. However, they do not take into account the relatively static background fact of surveillance video, or simply separate foreground and background in offline mode which reduces the separation performance because the temporal domain correlation is not considered very well. In this paper, we propose an end-to-end Unsupervised foreground-background separation based Video Compression neural Networks, dubbed as UVCNet. Our method mainly consists of three parts. First, the Mask Net unsupervisely separates foreground and background online which sufficiently uses the temporal correlation prior. Then, a traditional motion estimation-based residual coding module is applied to foreground compression. Simultaneously, a background compression module is applied to compress background residual and update the background by sufficiently using the relatively static property. Compared with previous approaches, our method does not separate foreground and background in advance but in an end-to-end manner. So we can not only use the relatively static background property to save bit rate, but also achieve end-to-end online video compression. Experimental results demonstrate that the proposed UVCNet achieves superior performance compared with the state-of-the-art methods. Specifically, UVCNet can achieve 2.11 dB average improvement on Peak Signal-to-Noise Ratio (PSNR) compared with H.265 on surveillance datasets.
引用
收藏
页码:966 / 978
页数:13
相关论文
共 50 条
  • [41] An algorithm for foreground-background separation in low quality patrimonial document images
    Mello, Carlos A. B.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 911 - 920
  • [42] Towards End-to-End Compression in Lustre
    Fuchs, Anna
    Squar, Jannek
    Kuhn, Michael
    2024 23RD INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, ISPDC 2024, 2024,
  • [43] IMPROVING UNSUPERVISED STYLE TRANSFER IN END-TO-END SPEECH SYNTHESIS WITH END-TO-END SPEECH RECOGNITION
    Liu, Da-Rong
    Yang, Chi-Yu
    Wu, Szu-Lin
    Lee, Hung-Yi
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 640 - 647
  • [44] Foreground-background separation transformer for weakly supervised surface defect detection
    Jiang, Xiaoheng
    Feng, Jian
    Yan, Feng
    Lu, Yang
    Fa, Quanhai
    Zhang, Wenjie
    Xu, Mingliang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024,
  • [45] End-To-End Security for Video Distribution
    Boho, Andras
    Van Wallendael, Glenn
    Dooms, Ann
    De Cock, Jan
    Braeckman, Geert
    Schelkens, Peter
    Preneel, Bart
    Van de Walle, Rik
    IEEE SIGNAL PROCESSING MAGAZINE, 2013, 30 (02) : 97 - 107
  • [46] Retargeting Video With an End-to-End Framework
    Le, Thi-Ngoc-Hanh
    Huang, HuiGuang
    Chen, Yi-Ru
    Lee, Tong-Yee
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6164 - 6176
  • [47] End-to-end Distributed Video Coding
    Zhou, Junwei
    Lv, Ting
    Yi, XiangBo
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 496 - 496
  • [48] End-to-end video background subtraction with 3d convolutional neural networks
    Sakkos, Dimitrios
    Liu, Heng
    Han, Jungong
    Shao, Ling
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 23023 - 23041
  • [49] End-to-end video background subtraction with 3d convolutional neural networks
    Dimitrios Sakkos
    Heng Liu
    Jungong Han
    Ling Shao
    Multimedia Tools and Applications, 2018, 77 : 23023 - 23041
  • [50] Video Multi-Scale-Based End-to-End Rate Control in Deep Contextual Video Compression
    Wei, Lili
    Yang, Zhenglong
    Zhang, Hua
    Liu, Xinyu
    Deng, Weihao
    Zhang, Youchao
    APPLIED SCIENCES-BASEL, 2024, 14 (13):