Object Detection-Based Video Compression

被引:3
|
作者
Kim, Myung-Jun [1 ]
Lee, Yung-Lyul [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul 05006, South Korea
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 09期
基金
新加坡国家研究基金会;
关键词
object detection; video compression; VVC (Versatile Video Coding); video coding application; quantization;
D O I
10.3390/app12094525
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Video compression is designed to provide good subjective image quality, even at a high-compression ratio. In addition, video quality metrics have been used to show the results can maintain a high Peak Signal-to-Noise Ratio (PSNR), even at high compression. However, there are many difficulties in object recognition on the decoder side due to the low image quality caused by high compression. Accordingly, providing good image quality for the detected objects is necessary for the given total bitrate for utilizing object detection in a video decoder. In this paper, object detection-based video compression by the encoder and decoder is proposed that allocates lower quantization parameters to the detected-object regions and higher quantization parameters to the background. Therefore, better image quality is obtained for the detected objects on the decoder side. Object detection-based video compression consists of two types: Versatile Video Coding (VVC) and object detection. In this paper, the decoder performs the decompression process by receiving the bitstreams in the object-detection decoder and the VVC decoder. In the proposed method, the VVC encoder and decoder are processed based on the information obtained from object detection. In a random access (RA) configuration, the average Bjontegaard Delta (BD)-rates of Y, Cb, and Cr increased by 2.33%, 2.67%, and 2.78%, respectively. In an All Intra (AI) configuration, the average BD-rates of Y, Cb, and Cr increased by 0.59%, 1.66%, and 1.42%, respectively. In an RA configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.17%, 0.23%, and 0.04%, respectively. In an AI configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.71%, 0.30%, and 0.30%, respectively. Subjective image quality was also improved in the object-detected areas.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] User Engagement Detection-Based Financial Technology Advertising Video Effectiveness Evaluation
    Gao, Qun
    JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2024, 36 (01)
  • [32] Impact of Video Compression on the Performance of Object Detection Systems for Surveillance Applications
    O'Byrne, Michael
    Sugrue, Mark
    Vibhoothi
    Kokaram, Anil
    2022 18TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS 2022), 2022,
  • [33] Review of the Deep Learning Models for Anomaly Detection-Based Video Scrutiny System
    Kukade, Jyoti
    Panse, Prashant
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 2, CIS 2023, 2024, 869 : 235 - 252
  • [34] Convolutional LSTM Based Video Object Detection
    Wang, Xiao
    Xie, Xiaohua
    Lai, Jianhuang
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 99 - 109
  • [35] Context based object detection from video
    Paletta, L
    Greindl, C
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2003, 2626 : 502 - 512
  • [36] Assessment of object-based video compression for epilepsy monitoring
    Sun, MU
    Liu, Q
    Scheuer, ML
    Sclabassi, RJ
    SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 1045 - 1046
  • [37] Arbitrarily shaped virtual-object based video compression
    Sharma, Naresh
    Zhu, Junda
    Zheng, Yuan F.
    Balster, Eric J.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 62 (03) : 659 - 680
  • [38] Arbitrarily shaped virtual-object based video compression
    Naresh Sharma
    Junda Zhu
    Yuan F. Zheng
    Eric J. Balster
    Multimedia Tools and Applications, 2013, 62 : 659 - 680
  • [39] A research on object-based system for fractal video compression
    Zhu, Shi-Ping
    Wang, Zai-Kuo
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2010, 21 (05): : 725 - 730
  • [40] COMPRESSION NOISE BASED VIDEO FORGERY DETECTION
    Ravi, Hareesh
    Subramanyam, A. V.
    Gupta, Gaurav
    Kumar, B. Avinash
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5352 - 5356