Deep Reference Frame Generation Method for VVC Inter Prediction Enhancement

被引:2
|
作者
Jia, Jianghao [1 ]
Zhang, Yuantong [1 ]
Zhu, Han [1 ]
Chen, Zhenzhong [1 ]
Liu, Zizheng [2 ]
Xu, Xiaozhong [3 ]
Liu, Shan [3 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430072, Peoples R China
[2] Tencent Shenzhen, Shenzhen 518000, Peoples R China
[3] Tencent Amer, Palo Alto, CA 94306 USA
关键词
Interpolation; Optical flow; Extrapolation; Bidirectional control; Kernel; Encoding; Streaming media; Neural-network-based video coding; versatile video coding (VVC); inter prediction; deep learning; NETWORK;
D O I
10.1109/TCSVT.2023.3299410
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In video coding, inter prediction aims to reduce temporal redundancy by using previously encoded frames as references. The quality of reference frames is crucial to the performance of inter prediction. This paper presents a deep reference frame generation method to optimize the inter prediction in Versatile Video Coding (VVC). Specifically, reconstructed frames are sent to a well-designed frame generation network to synthesize a picture similar to the current encoding frame. The synthesized picture serves as an additional reference frame inserted into the reference picture list (RPL) to provide a more reliable reference for subsequent motion estimation (ME) and motion compensation (MC). The frame generation network employs optical flow to predict motion precisely. Moreover, an optical flow reorganization strategy is proposed to enable bi-directional and uni-directional predictions with only a single network architecture. To reasonably apply our method to VVC, we further introduce a normative modification of the temporal motion vector prediction (TMVP). Integrated into the VVC reference software VTM-15.0, the deep reference frame generation method achieves coding efficiency improvements of 5.22%, 3.61%, and 3.83% for the Y component under random access (RA), low delay B (LDB), and low delay P (LDP) configurations, respectively. The proposed method has been discussed in Joint Video Exploration Team (JVET) meeting and is currently part of Exploration Experiments (EE) for further study.
引用
收藏
页码:3111 / 3124
页数:14
相关论文
共 50 条
  • [31] Quality enhancement of VVC intra-frame coding for multimedia services over the Internet
    Cho, Seunghyun
    Kim, Dong-Wook
    Jung, Seung-Won
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2020, 16 (05)
  • [32] Enhanced Motion-Compensated Video Coding With Deep Virtual Reference Frame Generation
    Zhao, Lei
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 4832 - 4844
  • [33] Learning-Based Complexity Reduction Scheme for VVC Intra-Frame Prediction
    Saldanha, Mario
    Sanchez, Gustavo
    Marcon, Cesar
    Agostini, Luciano
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [34] AUTOMATIC ENHANCEMENT OF THE REFERENCE SET FOR MULTI-CRITERIA SORTING IN THE FRAME OF THESEUS METHOD
    Fernandez, Eduardo
    Navarro, Jorge
    Salomon, Eduardo
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2014, 39 (02) : 57 - 77
  • [35] Fast reference frame and inter-mode selection method for H.264/AVC
    Kim, Hyungwook
    Lim, Sojeong
    Koo, Namhoon
    Yu, Sungwook
    SIGNAL IMAGE AND VIDEO PROCESSING, 2014, 8 (06) : 1087 - 1093
  • [36] Fast reference frame and inter-mode selection method for H.264/AVC
    Hyungwook Kim
    Sojeong Lim
    Namhoon Koo
    Sungwook Yu
    Signal, Image and Video Processing, 2014, 8 : 1087 - 1093
  • [37] Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding
    Feng, Aolin
    Liu, Kang
    Liu, Dong
    Li, Li
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2237 - 2251
  • [38] A CNN-Based Prediction-Aware Quality Enhancement Framework for VVC
    Nasiri, Fatemeh
    Hamidouche, Wassim
    Morin, Luce
    Dhollande, Nicolas
    Cocherel, Gildas
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2021, 2 : 466 - 483
  • [39] Inter-Frame Dependency-Based Rate Control for VVC Low-Delay Coding
    Liu, Hewei
    Zhu, Shuyuan
    Zeng, Bing
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2727 - 2731
  • [40] Inter Cross-Component Prediction Merge Mode for Video Coding beyond VVC
    Deng, Zhipin
    Zhang, Kai
    Zhang, Li
    2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 551 - 551