A convolutional neural network-based rate control algorithm for VVC intra coding

被引:2
|
作者
Wang, Jiafeng [1 ]
Shang, Xiwu [1 ]
Zhao, Xiaoli [1 ]
Zhang, Yuhuai [2 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
[2] Peking Univ, Inst Digital Media, Dept Elect Engn & Comp Sci, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
H.266/VVC; Intra coding; Rate control; Convolutional Neural Network (CNN); BLIND QUALITY ASSESSMENT; VIDEO;
D O I
10.1016/j.displa.2024.102652
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Versatile Video Coding (VVC) has shown significant improvements in Rate-Distortion (R-D) performance compared to its predecessor, High Efficiency Video Coding (HEVC). However, it still encounters several challenges. One of these challenges is the efficient allocation of bits among all Coding Tree Units (CTUs). Additionally, there is a lack of prior information for intra-frame coding, particularly for the first frame. After CTU-level bit allocation, only fixed parameters can be used to determine the lambda for CTUs, which does not result in optimal ratedistortion performance. To tackle above challenges, we propose a rate control solution based on Convolutional Neural Network (CNN). This approach utilizes CNN to predict the key parameters alpha and beta in the R-D model, addressing the problem of lacking prior information in intra-frame coding. Subsequently, the predicted alpha and beta values are used to adaptively allocate bits for each CTU. Our proposed algorithm is implemented in VTM-16.0 under Common Test Conditions (CTC). Experimental results show that, compared to the default rate control algorithm in VTM-16.0, our proposed algorithm enhances R-D performance by 0.96% while maintaining rate control accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] IMAGE CODING WITH NEURAL NETWORK-BASED COLORIZATION
    Lopes, Diogo
    Ascenso, Joao
    Brites, Catarina
    Pereira, Fernando
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4225 - 4229
  • [42] Neural Network Based Rate Control for Versatile Video Coding
    Mao, Yunhao
    Wang, Meng
    Ni, Zhangkai
    Wang, Shiqi
    Kwong, Sam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 6072 - 6085
  • [43] A CONVOLUTIONAL NEURAL NETWORK-BASED MODEL OF NEURAL PATHWAYS IN THE RETINA
    Zamani, Yasin
    Nategh, Neda
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 6906 - 6909
  • [44] Convolutional Neural Network-Based Robot Control for an Eye-in-Hand Camera
    Guo, Jia
    Nguyen, Huu-Thiet
    Liu, Chao
    Cheah, Chien Chern
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (08): : 4764 - 4775
  • [45] Convolutional Neural Network-Based Spacecraft Attitude Control for Docking Port Alignment
    Kim, Sang-Hyeon
    Choi, Han-Lim
    2017 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2017, : 484 - 489
  • [46] Fast CU size decision algorithm for VVC intra coding
    Xiwu Shang
    Guoping Li
    Xiaoli Zhao
    Hua Han
    Yifan Zuo
    Multimedia Tools and Applications, 2023, 82 : 28301 - 28322
  • [47] Convolutional neural network-based phantom image scoring for mammography quality control
    Sundell, Veli-Matti
    Maekelae, Teemu
    Vitikainen, Anne-Mari
    Kaasalainen, Touko
    BMC MEDICAL IMAGING, 2022, 22 (01)
  • [48] CONVOLUTIONAL NEURAL NETWORK-BASED FRACTAL CODING METHOD FOR IMAGE TRANSLATION IN MULTIMODAL CHANGE DETECTION
    Radoi, Anamaria
    Unsalan, Melisa
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1063 - 1066
  • [49] CONVOLUTIONAL NEURAL NETWORK-BASED INVERTIBLE HALF-PIXEL INTERPOLATION FILTER FOR VIDEO CODING
    Yan, Ning
    Liu, Dong
    Li, Bin
    Li, Houqiang
    Xu, Tong
    Wu, Feng
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 201 - 205
  • [50] Fast CU size decision algorithm for VVC intra coding
    Shang, Xiwu
    Li, Guoping
    Zhao, Xiaoli
    Han, Hua
    Zuo, Yifan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 28301 - 28322