Object Detection-Based Video Compression

被引：3

作者：

Kim, Myung-Jun ^{[1
]}

Lee, Yung-Lyul ^{[1
]}

机构：

[1] Sejong Univ, Dept Comp Engn, Seoul 05006, South Korea

来源：

APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 09期

基金：

新加坡国家研究基金会;

关键词：

object detection; video compression; VVC (Versatile Video Coding); video coding application; quantization;

D O I：

10.3390/app12094525

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Video compression is designed to provide good subjective image quality, even at a high-compression ratio. In addition, video quality metrics have been used to show the results can maintain a high Peak Signal-to-Noise Ratio (PSNR), even at high compression. However, there are many difficulties in object recognition on the decoder side due to the low image quality caused by high compression. Accordingly, providing good image quality for the detected objects is necessary for the given total bitrate for utilizing object detection in a video decoder. In this paper, object detection-based video compression by the encoder and decoder is proposed that allocates lower quantization parameters to the detected-object regions and higher quantization parameters to the background. Therefore, better image quality is obtained for the detected objects on the decoder side. Object detection-based video compression consists of two types: Versatile Video Coding (VVC) and object detection. In this paper, the decoder performs the decompression process by receiving the bitstreams in the object-detection decoder and the VVC decoder. In the proposed method, the VVC encoder and decoder are processed based on the information obtained from object detection. In a random access (RA) configuration, the average Bjontegaard Delta (BD)-rates of Y, Cb, and Cr increased by 2.33%, 2.67%, and 2.78%, respectively. In an All Intra (AI) configuration, the average BD-rates of Y, Cb, and Cr increased by 0.59%, 1.66%, and 1.42%, respectively. In an RA configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.17%, 0.23%, and 0.04%, respectively. In an AI configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.71%, 0.30%, and 0.30%, respectively. Subjective image quality was also improved in the object-detected areas.

引用

页数：18

共 50 条

[31] User Engagement Detection-Based Financial Technology Advertising Video Effectiveness Evaluation
Gao, Qun
JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2024, 36 (01)
[32] Impact of Video Compression on the Performance of Object Detection Systems for Surveillance Applications
O'Byrne, Michael
Sugrue, Mark
Vibhoothi
Kokaram, Anil
2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
[33] Review of the Deep Learning Models for Anomaly Detection-Based Video Scrutiny System
Kukade, Jyoti
Panse, Prashant
FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 2, CIS 2023, 2024, 869 : 235 - 252
[34] Convolutional LSTM Based Video Object Detection
Wang, Xiao
Xie, Xiaohua
Lai, Jianhuang
PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 99 - 109
[35] Context based object detection from video
Paletta, L
Greindl, C
COMPUTER VISION SYSTEMS, PROCEEDINGS, 2003, 2626 : 502 - 512
[36] Assessment of object-based video compression for epilepsy monitoring
Sun, MU
Liu, Q
Scheuer, ML
Sclabassi, RJ
SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 1045 - 1046
[37] Arbitrarily shaped virtual-object based video compression
Sharma, Naresh
Zhu, Junda
Zheng, Yuan F.
Balster, Eric J.
MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 62 (03) : 659 - 680
[38] Arbitrarily shaped virtual-object based video compression
Naresh Sharma
Junda Zhu
Yuan F. Zheng
Eric J. Balster
Multimedia Tools and Applications, 2013, 62 : 659 - 680
[39] A research on object-based system for fractal video compression
Zhu, Shi-Ping
Wang, Zai-Kuo
Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2010, 21 (05): : 725 - 730
[40] COMPRESSION NOISE BASED VIDEO FORGERY DETECTION
Ravi, Hareesh
Subramanyam, A. V.
Gupta, Gaurav
Kumar, B. Avinash
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5352 - 5356

← 1 2 3 4 5 →