WCDANN: A Lightweight CNN Post-Processing Filter for VVC-Based Video Compression

被引:2
|
作者
Zhang, Hao [1 ]
Jung, Cheolkon [1 ]
Zou, Dan [2 ]
Li, Ming [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Guangdong OPPO Mobile Telecommun Corp, Dongguan 523860, Peoples R China
基金
中国国家自然科学基金;
关键词
Video compression; attention; convolutional neural network; depthwise separable convolution; in-loop filter; post-processing;
D O I
10.1109/ACCESS.2023.3301145
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a weakly connected dense attention neural network for compression artifact removal, called WCDANN. WCDANN is a convolutional neural network (CNN)-based post-processing filter to enhance the quality of versatile video coding (VVC)-decoded videos without requiring any codec changes. WCDANN consists of several weakly connected dense attention blocks (WCDABs) based on residual learning, which takes the compressed video after codecs as the input. We use depthwise separable convolution for WCDANN as the basic convolution unit to generate a lightweight model. Moreover, we introduce attention mechanisms into the proposed filter to capture important features. Experimental results show that WCDANN achieves good performance in Bjontegaard Delta Bit Rate (BD-BR). Compared with VTM-11.0-NNVC anchor, WCDANN achieves average 2.81%, 4.12% and 3.81% BD-rate reductions for Y channel on A1, A2, B, C, D and E classes in RA, AI and LDP configurations, respectively.
引用
收藏
页码:83400 / 83413
页数:14
相关论文
共 50 条
  • [31] An automatic facial beautification method for video post-processing
    Zhou, Yifeng
    Tian, Wensha
    Yu, Chengrong
    Jiang, Bo
    Liu, Sijiang
    THIRD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2018, 10828
  • [32] Video coding with super-resolution post-processing
    Kondo, Satoshi
    Toma, Tadamasa
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 3141 - +
  • [33] The application of EPLD on HDTV video signal post-processing
    Jiang, LH
    Liu, YH
    Chai, ZM
    Liu, G
    CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (03): : 401 - 405
  • [34] Robust and efficient post-processing for video object detection
    Sabater, Alberto
    Montesano, Luis
    Murillo, Ana C.
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10536 - 10542
  • [35] Overcoming the practical restrictions in H.266/VVC-based video communication systems by a PI bit rate controller
    Farhad Raufmehr
    Mohammad Reza Salehi
    Ebrahim Abiri
    Multimedia Systems, 2022, 28 : 1723 - 1739
  • [36] Performance analysis of multiview video compression based on MIV and VVC multilayer
    Lee, Jinho
    Bang, Gun
    Kang, Jungwon
    Teratani, Mehrdad
    Lafruit, Gauthier
    Choi, Haechul
    ETRI JOURNAL, 2024, 46 (06) : 1075 - 1089
  • [37] A neuro-fuzzy QP estimation approach for H.266/VVC-based live video broadcasting systems
    Raufmehr, Farhad
    Salehi, Mohammad Reza
    Abiri, Ebrahim
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (19) : 56423 - 56443
  • [38] Overcoming the practical restrictions in H.266/VVC-based video communication systems by a PI bit rate controller
    Raufmehr, Farhad
    Salehi, Mohammad Reza
    Abiri, Ebrahim
    MULTIMEDIA SYSTEMS, 2022, 28 (05) : 1723 - 1739
  • [39] CNN Based Image Restoration Adjusting Ill-Exposed sRGB Images in Post-Processing
    Steffens, Cristiano R.
    Messias, Lucas R., V
    Drews-Jr, Paulo J. L.
    Botelho, Silvia S. D. C.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2020, 99 (3-4) : 609 - 627
  • [40] MFRNet: A New CNN Architecture for Post-Processing and In-loop Filtering
    Ma, Di
    Zhang, Fan
    Bull, David R.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 378 - 387