Enhanced Machine Learning-based Inter Coding for VVC

被引:4
|
作者
Benjak, Martin [1 ]
Meuel, Holger [1 ]
Laude, Thorsten [1 ]
Ostermann, Jorn [1 ]
机构
[1] Leibniz Univ Hannover, Inst Informat Verarbeitung, Hannover, Germany
关键词
VVC; inter coding; video coding; machine learning; recurrent neural networks;
D O I
10.1109/ICAIIC51459.2021.9415184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture buffer are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.
引用
收藏
页码:21 / 25
页数:5
相关论文
共 50 条
  • [21] Adaptive Deep Reinforcement Learning-Based In-Loop Filter for VVC
    Huang, Zhijie
    Sun, Jun
    Guo, Xiaopeng
    Shang, Mingyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5439 - 5451
  • [22] Machine Learning-Based Task Clustering for Enhanced Virtual Machine Utilization in Edge Computing
    Alnoman, Ali
    2020 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2020,
  • [23] Machine Learning-based Fast Intra Coding Unit Depth Decision for High Efficiency Video Coding
    Chen, Zong-Yi
    Fang, Jiunn-Tsair
    Liu, Yen-Chun
    Chang, Pao-Chi
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (05) : 1289 - 1299
  • [24] Machine Learning-Based Feature Mapping for Enhanced Understanding of the Housing Market
    Lystbaek, Michael Sahl
    Srirajan, Tharsika Pakeerathan
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2024, 2024, 2141 : 530 - 543
  • [25] STATISTICAL ANALYSIS OF INTER CODING IN VVC TEST MODEL (VTM)
    Liu, Yiqun
    Abdoli, Mohsen
    Guionnet, Thomas
    Guillemot, Christine
    Roumy, Aline
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3456 - 3459
  • [26] Software-Defined IoT with Machine Learning-Based Enhanced Security
    Husnain, Ali
    Nguyen, Chau
    Le, Ngoc Thuy
    2023 28TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS, APCC 2023, 2023, : 430 - 435
  • [27] An Enhanced Model for Machine Learning-Based DoS Detection in Vehicular Networks
    Ercan, Secil
    Mendiboure, Leo
    Alouache, Lylia
    Maaloul, Sassi
    Sylla, Tidiane
    Aniss, Hasnaa
    2023 IFIP NETWORKING CONFERENCE, IFIP NETWORKING, 2023,
  • [28] Learning-based encoder algorithms for VVC in the context of the optimized VVenC implementation
    Tech, Gerhard
    George, Valeri
    Pfaff, Jonathan
    Wieckowski, Adam
    Bross, Benjamin
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [29] Switchable Motion Models for Non-Block-Based Inter Prediction in Learning-Based Video Coding
    Brand, Fabian
    Seiler, Jurgen
    Kaup, Andre
    2021 PICTURE CODING SYMPOSIUM (PCS), 2021, : 161 - 165
  • [30] Learning-based Multiview Video Coding
    Bai, Baochun
    Cheng, Li
    Lei, Cheng
    Boulanger, Pierre
    Harms, Janelle
    PCS: 2009 PICTURE CODING SYMPOSIUM, 2009, : 201 - +