Object Detection-Based Video Compression

被引:3
|
作者
Kim, Myung-Jun [1 ]
Lee, Yung-Lyul [1 ]
机构
[1] Sejong Univ, Dept Comp Engn, Seoul 05006, South Korea
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 09期
基金
新加坡国家研究基金会;
关键词
object detection; video compression; VVC (Versatile Video Coding); video coding application; quantization;
D O I
10.3390/app12094525
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Video compression is designed to provide good subjective image quality, even at a high-compression ratio. In addition, video quality metrics have been used to show the results can maintain a high Peak Signal-to-Noise Ratio (PSNR), even at high compression. However, there are many difficulties in object recognition on the decoder side due to the low image quality caused by high compression. Accordingly, providing good image quality for the detected objects is necessary for the given total bitrate for utilizing object detection in a video decoder. In this paper, object detection-based video compression by the encoder and decoder is proposed that allocates lower quantization parameters to the detected-object regions and higher quantization parameters to the background. Therefore, better image quality is obtained for the detected objects on the decoder side. Object detection-based video compression consists of two types: Versatile Video Coding (VVC) and object detection. In this paper, the decoder performs the decompression process by receiving the bitstreams in the object-detection decoder and the VVC decoder. In the proposed method, the VVC encoder and decoder are processed based on the information obtained from object detection. In a random access (RA) configuration, the average Bjontegaard Delta (BD)-rates of Y, Cb, and Cr increased by 2.33%, 2.67%, and 2.78%, respectively. In an All Intra (AI) configuration, the average BD-rates of Y, Cb, and Cr increased by 0.59%, 1.66%, and 1.42%, respectively. In an RA configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.17%, 0.23%, and 0.04%, respectively. In an AI configuration, the averages of Delta Y-PSNR, Delta Cb-PSNR, and Delta Cr-PSNR for the object-detected areas improved to 0.71%, 0.30%, and 0.30%, respectively. Subjective image quality was also improved in the object-detected areas.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] Detection-based Object Labeling in 3D Scenes
    Lai, Kevin
    Bo, Liefeng
    Ren, Xiaofeng
    Fox, Dieter
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 1330 - 1337
  • [12] A DETECTION-BASED APPROACH TO BROADCAST NEWS VIDEO STORY SEGMENTATION
    Ma, Chengyuan
    Byun, Byungki
    Kim, Ilseo
    Lee, Chin-Hui
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1957 - 1960
  • [13] Automotive DNN-Based Object Detection in the Presence of Lens Obstruction and Video Compression
    Baris, Gabriele
    Li, Boda
    Chan, Pak Hung
    Avizzano, Carlo Alberto
    Donzella, Valentina
    IEEE ACCESS, 2025, 13 : 36575 - 36589
  • [14] An object detection-based framework for automatically recognizing iStar hand drafts
    Zhu, Yuran
    Li, Tong
    AUTOMATED SOFTWARE ENGINEERING, 2022, 29 (02)
  • [15] Unique Measure for Geometrical Shape Object Detection-based on Area Matching
    Chaudhuri, D.
    Kushwaha, N. K.
    Sharif, I.
    Gohri, V.
    DEFENCE SCIENCE JOURNAL, 2012, 62 (01) : 58 - 66
  • [16] An object detection-based framework for automatically recognizing iStar hand drafts
    Yuran Zhu
    Tong Li
    Automated Software Engineering, 2022, 29
  • [17] Object Detection-based Automatic Waste Segregation using Robotic Arm
    Ibrahim, Azza Elsayed
    Shoitan, Rasha
    Moussa, Mona M.
    Elnemr, Heba A.
    Cho, Young Im
    Abdallah, Mohamed S.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (06) : 912 - 926
  • [18] NCOD: Near-Optimum Video Compression for Object Detection
    Elahi, Ardavan
    Falahati, Ali
    Pakdaman, Farhad
    Modarressi, Mehdi
    Gabbouj, Moncef
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [19] A practical object detection-based multiscale attention strategy for person reidentification
    Zhang, Bin
    Song, Zhenyu
    Huang, Xingping
    Qian, Jin
    Cai, Chengfei
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (12): : 6772 - 6791
  • [20] Object Detection-Based License Plate Localization and Recognition in Complex Environments
    Tao, Ting
    Dong, Decun
    Huang, Shize
    Chen, Wei
    Yang, Lingyu
    TRANSPORTATION RESEARCH RECORD, 2020, 2674 (12) : 212 - 223