End-To-End Compression for Surveillance Video With Unsupervised Foreground-Background Separation

被引：9

作者：

Zhao, Yu ^{[1
]}

Luo, Dengyan ^{[1
]}

Wang, Fuchun ^{[1
]}

Gao, Han ^{[1
]}

Ye, Mao ^{[1
]}

Zhu, Ce ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China

来源：

IEEE TRANSACTIONS ON BROADCASTING | 2023年 / 69卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Encoding; Surveillance; Video compression; Video coding; Neural networks; Deep learning; Streaming media; foreground-background separation; surveillance video; PREDICTION; CASCADE; FRAMES; HEVC;

D O I：

10.1109/TBC.2023.3280039

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the exponential growth of surveillance video, efficient video coding method is in great demand. The learning-based methods emerge which either directly use a general video compression framework, or separate the foreground and background and then compress them in two stages. However, they do not take into account the relatively static background fact of surveillance video, or simply separate foreground and background in offline mode which reduces the separation performance because the temporal domain correlation is not considered very well. In this paper, we propose an end-to-end Unsupervised foreground-background separation based Video Compression neural Networks, dubbed as UVCNet. Our method mainly consists of three parts. First, the Mask Net unsupervisely separates foreground and background online which sufficiently uses the temporal correlation prior. Then, a traditional motion estimation-based residual coding module is applied to foreground compression. Simultaneously, a background compression module is applied to compress background residual and update the background by sufficiently using the relatively static property. Compared with previous approaches, our method does not separate foreground and background in advance but in an end-to-end manner. So we can not only use the relatively static background property to save bit rate, but also achieve end-to-end online video compression. Experimental results demonstrate that the proposed UVCNet achieves superior performance compared with the state-of-the-art methods. Specifically, UVCNet can achieve 2.11 dB average improvement on Peak Signal-to-Noise Ratio (PSNR) compared with H.265 on surveillance datasets.

引用

页码：966 / 978

页数：13

共 50 条

[31] Foreground-background separation and deblurring super-resolution method☆
Liu, Xuebin
Chen, Yuang
Zhao, Chongji
Yang, Jie
Deng, Huan
OPTICS AND LASERS IN ENGINEERING, 2025, 184
[32] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
Wang, Hanyu
Wang, Ping
Qian, Xueming
IEEE ACCESS, 2018, 6 : 30296 - 30308
[33] End-to-End Learning of Video Compression Using Spatio-Temporal Autoencoders
Pessoa, Jorge
Aidos, Helena
Tomas, Pedro
Figueiredo, Mario A. T.
2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 276 - 281
[34] FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space
Hu, Zhihao
Xu, Dong
Lu, Guo
Jiang, Wei
Wang, Wei
Liu, Shan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4569 - 4585
[35] End-to-End Image Patch Quality Assessment for Image/Video With Compression Artifacts
Tung Thanh Pham
Xiem Van Hoang
Nghia Trung Nguyen
Duong Trieu Dinh
Le Thanh Ha
IEEE ACCESS, 2020, 8 : 215157 - 215172
[36] Learning-based End-to-End Video Compression Using Predictive Coding
de Oliveira, Matheus C.
Martins, Luiz G. R.
Jung, Henrique Costa
Guerin Jr, Nilson Donizete
da Silva, Renam Castro
Peixoto, Eduardo
Macchiavello, Bruno
Hung, Edson M.
Testoni, Vanessa
Freitas, Pedro Garcia
2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 160 - 167
[37] Video Foreground-Background Separation via Weighted Schatten-p Norm and Structured Sparsity Decomposition
Wei Yufeng
Jing Mingli
Li Lan
Sun Kun
Fan Ruibo
LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (08)
[38] An Adaptive Foreground-Background Separation Method for Effective Binarization of Document Images
Das, Bishwadeep
Bhowmik, Showmik
Saha, Aniruddha
Sarkar, Ram
PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 515 - 524
[39] UNSUPERVISED MODEL ADAPTATION FOR END-TO-END ASR
Sivaraman, Ganesh
Casal, Ricardo
Garland, Matt
Khoury, Elie
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6987 - 6991
[40] TOWARDS END-TO-END UNSUPERVISED SPEECH RECOGNITION
Liu, Alexander H.
Hsu, Wei-Ning
Auli, Michael
Baevski, Alexei
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 221 - 228

← 1 2 3 4 5 →