End-To-End Compression for Surveillance Video With Unsupervised Foreground-Background Separation

被引:9
|
作者
Zhao, Yu [1 ]
Luo, Dengyan [1 ]
Wang, Fuchun [1 ]
Gao, Han [1 ]
Ye, Mao [1 ]
Zhu, Ce [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Encoding; Surveillance; Video compression; Video coding; Neural networks; Deep learning; Streaming media; foreground-background separation; surveillance video; PREDICTION; CASCADE; FRAMES; HEVC;
D O I
10.1109/TBC.2023.3280039
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the exponential growth of surveillance video, efficient video coding method is in great demand. The learning-based methods emerge which either directly use a general video compression framework, or separate the foreground and background and then compress them in two stages. However, they do not take into account the relatively static background fact of surveillance video, or simply separate foreground and background in offline mode which reduces the separation performance because the temporal domain correlation is not considered very well. In this paper, we propose an end-to-end Unsupervised foreground-background separation based Video Compression neural Networks, dubbed as UVCNet. Our method mainly consists of three parts. First, the Mask Net unsupervisely separates foreground and background online which sufficiently uses the temporal correlation prior. Then, a traditional motion estimation-based residual coding module is applied to foreground compression. Simultaneously, a background compression module is applied to compress background residual and update the background by sufficiently using the relatively static property. Compared with previous approaches, our method does not separate foreground and background in advance but in an end-to-end manner. So we can not only use the relatively static background property to save bit rate, but also achieve end-to-end online video compression. Experimental results demonstrate that the proposed UVCNet achieves superior performance compared with the state-of-the-art methods. Specifically, UVCNet can achieve 2.11 dB average improvement on Peak Signal-to-Noise Ratio (PSNR) compared with H.265 on surveillance datasets.
引用
收藏
页码:966 / 978
页数:13
相关论文
共 50 条
  • [31] Foreground-background separation and deblurring super-resolution method☆
    Liu, Xuebin
    Chen, Yuang
    Zhao, Chongji
    Yang, Jie
    Deng, Huan
    OPTICS AND LASERS IN ENGINEERING, 2025, 184
  • [32] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
    Wang, Hanyu
    Wang, Ping
    Qian, Xueming
    IEEE ACCESS, 2018, 6 : 30296 - 30308
  • [33] End-to-End Learning of Video Compression Using Spatio-Temporal Autoencoders
    Pessoa, Jorge
    Aidos, Helena
    Tomas, Pedro
    Figueiredo, Mario A. T.
    2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 276 - 281
  • [34] FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space
    Hu, Zhihao
    Xu, Dong
    Lu, Guo
    Jiang, Wei
    Wang, Wei
    Liu, Shan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4569 - 4585
  • [35] End-to-End Image Patch Quality Assessment for Image/Video With Compression Artifacts
    Tung Thanh Pham
    Xiem Van Hoang
    Nghia Trung Nguyen
    Duong Trieu Dinh
    Le Thanh Ha
    IEEE ACCESS, 2020, 8 : 215157 - 215172
  • [36] Learning-based End-to-End Video Compression Using Predictive Coding
    de Oliveira, Matheus C.
    Martins, Luiz G. R.
    Jung, Henrique Costa
    Guerin Jr, Nilson Donizete
    da Silva, Renam Castro
    Peixoto, Eduardo
    Macchiavello, Bruno
    Hung, Edson M.
    Testoni, Vanessa
    Freitas, Pedro Garcia
    2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 160 - 167
  • [37] Video Foreground-Background Separation via Weighted Schatten-p Norm and Structured Sparsity Decomposition
    Wei Yufeng
    Jing Mingli
    Li Lan
    Sun Kun
    Fan Ruibo
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
  • [38] An Adaptive Foreground-Background Separation Method for Effective Binarization of Document Images
    Das, Bishwadeep
    Bhowmik, Showmik
    Saha, Aniruddha
    Sarkar, Ram
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 515 - 524
  • [39] UNSUPERVISED MODEL ADAPTATION FOR END-TO-END ASR
    Sivaraman, Ganesh
    Casal, Ricardo
    Garland, Matt
    Khoury, Elie
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6987 - 6991
  • [40] TOWARDS END-TO-END UNSUPERVISED SPEECH RECOGNITION
    Liu, Alexander H.
    Hsu, Wei-Ning
    Auli, Michael
    Baevski, Alexei
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 221 - 228